—— nnd рери: 


BETTER 
SAMPLING 
CONCEPTS | 


TECHNIQUES 
EVALUATION 


Y. P. Aggarwal 


BETTER SAMPLING 
Concepts. Techniques and Evaluation 


Le 
BETTER SAMPLING 


Concepts, Techniques and Evaluation 


Y.P. AGGARWAL 


. STERLING PUBLISHERS PRIVATE LIMITED 


STERLING PUBLISHERS PRIVATE LIMITED 
L-10, Green Park Extension, New Delhi - 110 016 
G-2, Cunningham Apartments, Cunningham Road, Bangalore - 560 052 


LUET. "west seugc: 


Date T FE 9 it : 2| 


& «o. Mo. ^ M. оди 


Better Sampling : Concepts, Techniques and Evaluation 
O 1988, Y.P. AGGARWAL 


All rights reserved. No part of this publication may be reproduced, stored 
in a retrieval system or transmitted, in any form or by any means, 
mechanical, photocopying, recording or otherwise, without prior written 
permission of the Publisher. 


PRINTED IN INDIA 


Published by S. K. Ghai, Managing Director, Sterling Publishers Pvt. Ltd. L-10, Green Park 
Extension, New Delhi - 110 016. Printed at Ram Printograph, New Delhi and Lasertypeset 
by ProntoPrint Pvt. Ltd., 48, Okhla Industrial Estate, New Delhi - 110 020. 


РЕЕЕАСЕ 


The book presents а non-mathematical yet comprehensive survey of 
the basic principles of sampling. It is an attempt to make available to the 
research scholars and teachers of various disciplines, a treatment of the 
subject which can be called intermediate between the very sophisticated 
and mathematical works like those of Cochran (1972) and Hansen, 
Hurwitz and Madow (1966) and the sketchy and inadequate treatment 
presented in the books on research methodology. While the former serve 
as text books for advanced courses in sampling theory and hence beyond 
the comprehension of average students of research in various social 
sciences, the latter are utterly inadequate for making the students 
sufficiently conversant with the techniques and their applications. 

The main emphasis, in this book, has been on the description and 
critical appraisal of the principles and methods of sampling and their 
application to various types of problems. Illustrations rather than 
formulas have been used to clarify the discussion and to aid the intuition. 
The illustrations have been taken from such diverse fields as education, 
psychology, sociology, business administration, commerce, economics, 
forestry and public administration. Hence the book is likely to be useful 
for students and teachers of research in these fields. A comprehensive 
bibliography has been given to aid the students in searching out further 
reading material. Tables of Random Numbers have been added and the 
method of their use explained fully with illustrations. 

The book first appeared under the title Sampling Methods for Social 
Investigation. The content since then has been thoroughly revised, 
enlarged and enriched. Several new sections have been added to make 
the book more useful. A number of diagrams have also been given to aid 


understanding. A scale for self-evaluation of sampling has been devised 
and incorporated to enable the researchers to havea self assessment of the 
sampling procedures used by them. In the evaluation of others' research 
reports also, this tool may be found of great help. A glossary of important 


terms has also been added. 
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FUNDAMENTAL NOTIONS OF SAMPLING 


THE MAIN purpose of this chapter is to describe the general principles 
involved in sampling. Very often our attitudes, our knowledge and our 
actions are based on samples. It applies equally to everyday life and to 
scientific research. A person's opinion of a bank, or a shop, or an 
institution is generally based on one or two encounters which he had with 
it in the course of several years of working with the former. A visitor's 
opinion about a country after spending a few days in it will be determined 
by his experiences of a few places he has seen and a few persons he has 
met. His conclusions about that country will differ from those of a social 
scientist who spends a number of years in studying the political and the 
social system of that country and uses the scientific procedures of 
investigation. Perhaps our visitor is less likely to be aware of the extent 
of his ignorance. Generally a vaguely formulated understanding of 
sampling is part and parcel of what is called common sense and is 
characteristic of everyday approach. However, this common place usage 
of the term is becoming less informal and unconscious. More and more 
the informed public, when confronted by the results of apollorsomeother 
conclusion based on sampling procedure, is asking pertinent questions. 
Who and what type of persons were interviewed ? How many were 
interviewed ? No longer are such unqualified statements as “Most of the 
subjects responded negatively,” or “over half the juvenile delinquents 
surveyed came from broken homes” taken as serious scientific 
propositions.' 

Now let us view а few more commonplace examples of sampling — 
the trader-examining a handful of grains from the bag, the tea-taster trying 
different brands of tea, a technician making a blood-test and a housewife 
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checking a few grains of rice from the pan to determine whether the entire 
lothas been properly cooked and become soft. All of them are employing 
the method of sampling. Their confidence in their judgements rests on ће 
fact that the material they are sampling is so well mixed or homogeneous 
that the few grains of wheat, a drop of blood, a few leaves of tea ora few 
grains of rice do adequately represent the whole. 


1.1 Population and Sample 

A sample is a miniature picture of the entire group or aggregate from 
which it has been taken. A Sample, in other words, is a smaller 
representation ofa larger whole. The entire group from which the sample 
has been taken is known as the “population”. The terms “universe” and 
“supply” are also used but are less popular. The term population, in 
research, is used in a broader sense than its commonplace meanings as a 
population of people. A population may consist of persons, objects, 
attributes, qualities, behaviours of people, answers to various items of a 
test, the behaviour of inanimate objects such as throws of dice or coins, 
Cities, families, opinions of the electorate of a nation and the like. A 
population is a well defined group of any of these. The definition should 
be explicit enough to permit anyone to Say with confidence that a 
particular object or person is in the defined population and another not. 
Defining a population means fixing the limits in terms of one or more of 
its various aspects. A few of them are described as follows: 


1. Geographical Limits : A study may be confined toa town, or village, 
a district, a province, a country or the whole world. The World Surveys 
of Education? conducted by UNESCO include all the countries of the 
world. 

2. Ageor Grade : А study may include children in the age group 8+ to 
10+, or іп grades 3 105. , 

3. Sexand Socio-Economic Status : The study may be confined to men 
or women only, or a particular socio-economic status group only. 

4. Physical Attributes : Just as weight, height, colour of eyes, a 
particular blood group and the like. 

5. Psycho-Social Behaviour : Delinquents, neurotic offenders, 
normals, hospitalized, schizophrenic, unsuccessful or successful 
candidates in a particular examination, all prize winners in a lottery, 
recipients of Vir-Chakra and the like. 
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The selection of one or more of these criteria in defining a population 
depends on the nature and scope of the research in hand. 

Kempthorne’ has distinguished between “experimentally accessible 
population" and "target population". The former is the population of 
subjects that is available to the researcher for his study. The target 
population is the total group of subjects about whom the researcher is 
empirically attempting to learn something. For example, a researcher has 
discovered a new approach to teaching map reading to the 8th class 
students. Probably he would like to conclude that his method was better 
for “all 8th class students in India", the target population. However, he 
randomly selects his sample from all "'8th class students in his state", the 
experimentally accessible population. The experimenter will have to 
make the following two jumps in his generalization : 


Target Population 


Experimentally 
Accessible Population 


Figure 1.1 Relatlonship of Sample and Target Population 
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1. Ајитрітот the sample to the experimentally accessible population, 
and 

2. Another jump from the experimentally accessible population to the 
target population. 

The first jump is justified on the basis of inferential statistics and does 
not present any problem if the sample has been selected randomly from 
the experimentally accessible population. 

In the above example, if the experimenter has selected his sample 
randomly from all 8th class students of India, the experimentally 
accessible population would have been very much like the larget 
population and inference could have been made with greater confidence. 
If the sample has not been randomly drawn from the accessible 
population, the generalization with probabilistic rigour would not be 
Possible. In such a situation, the sample becomes the experimentally 
accessible population. However, some authors like Cornfield and Тикеу“ 
advocate generalization from the sample to a population "like those 
Observed". 

The second jump, from the accessible population to the target 
population is relatively less justified and lacks rigour as compared to the 
first jump. The degree of statistical confidence with which an 
experimenter can generalize the target population is never known 
because the experimenter is never able to sample randomly from the true 
target population. 

For a further discussion of these concepts the reader is referred to 
Kempthorne’ and Bracht and Glass. The population may also be 
classified as "real", "artificial" and "hypothetical" populations. A real 
population is one which actually exists. Anartificial populationis created 
by the researcher in order to illustrate a principle, or to make for more 
convenience and ease in carrying out the study of a problematic situation. 
A hypothetical population is an artificial population devised purely on 
theoretical basis. This type of population enables the researcher to work 
with an infinite population of entities, when the real population under 
study is much smaller. “Simulation” is used for the creation of artificial 
populations and taking samples out of them. 

Populations may also be categorized as “known” and “unknown”, In 
a known population the frequency distribution and/or the parameters 
(Mean, Mode, Median, SD etc. ) are known. A population for which such 
estimates are not available may be termed as an “unknown population”. 
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However, in most practical research, the populations used are unknown 
and parameters are computed on random samples. 

In cases where the design of research allows for the inclusion of the 
whole population if it was accessible and also within manageable limits, 
sampling may not be resorted to and the study conducted on the entire 
group. However, in studies involving large populations, sampling 
provides a less expensive, more efficient and quicker method of data 
collection. 


1.2 Advantages of Sampling 
The following advantages generally accrue from using a sample rather 
than the entire population.” 
Reduced Cost : When data are collected only from a small fraction of the 
entire population, expenditures are smaller than if the entire group was 
studied. Surveys sometimes use as small a number as one person in one 
thousand. 
Greater Speed : The volume of the data to be collected will be smaller. 
Hence it can be collected, tabulated and summarized more quickly with 
asample than with the total population. In applied research where urgent 
answers to certain problems are needed, this aspect gets an added 
importance. 
Greater Scope : In studies where a complete enumeration and census of : 
allunits of population are impracticable and the research requires the use 
of highly trained personnel or specialized equipment, the choice may lie 
between collecting the information by sampling or abandoning the 
research itself. The use of the Rorschach, the TAT, the CAT and the 
Stanford-Binet are some of the instances in which sampling becomes all 
the more important. Thus surveys using sampling provide greater 
flexibility and scope. 
Greater Accuracy : With the reduction in the volume of work, personnel 
of higher expertise and training can be employed and a more careful 
supervision of the field work and processing of the data are possible. 
Hence sampling may produce results which are more accurate than those 
which could have been obtained through a complete census. Moreover, 
sampling is particularly more important in obtaining accurate results 
about phenomena which are undergoing rapid changes such as opinions 
about political and social issues. 

Мошу“ has aptly summarized the advantages of sampling in the 
following words : 
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Sampling is both necessary and advantageous. Taking a complete 
census is generally both costly and difficult ; in many cases it is 
completely impossible. What is not so clearly recognized by a 
layman, who feels that one takes a sample when he cannot get a 
complete census, is that sampling frequently results in more 
adequate data than a complete census. In an interview study, for 
example, sampling not only saves money but also permits greater 
care and control to be asserted; it allows for better training and 
coordination among the interviewers; it permits greater depth in 
interviewing; it allows the interviews to be conducted in a relatively 
short time so that the distorting effects of the passage of time are 
minimized; it also permits greater depth in analysis and greater 
accuracy in processing. 

Sampling is an essential part of all scientific procedure. It is not 
merely an adjunct of the social sciences. It is already well developed in 
biological researches in the field of agriculture. It is also a basic tool in 
other areas of biology, as well as in physics and chemistry. Science, 
committed as it is today to the variability of observations in all fields, is 
forced to sample from a universe of possible observations rather than 
believing that one observation provides an absolute and immutable truth. 


1.3 The Requirements of a Good Sample 
There are two basic requirements of a good sample — its 
representativeness and adequacy. 

If information from sample data is to be generalized to a population, 
itis essential that the sample should be representative of that population. 
Inthestrict sense of the term arepresentative sample would be a miniature 
or replica ideally in all respects of the population from which it has been 
drawn. It should apply at least to the characteristics directly under 
investigation or those likely to affect these characteristics indirectly, But 
if in order to check the representativeness of asample, the corresponding 
characteristics of the population would have to be known, there would, 
then, be no need to have a sample. Hence in practical use the researcher 
may seek a random sample rather than necessarily a perfectly 
representative one. Such a sample will fall within the range of random 
sampling errors, and will facilitate the estimation of the population 
parameter on the basis of probability theory. 

A good sample not only needs to be representative, it neéds also to 
be adequate or of sufficient size to allow confidence in the stability of its 
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characteristics. 

The research supervisors and advisorsare very often asked, “How big 
a sample do I need?” “Will 200 cases or 500 cases or 5 per cent of the 
population be sufficient for the study?" The researcher may feel 
disappointed when told that an accurate answer to his questions was not 
possible unless he himself provides a good deal of information relating 
to the study, the population it supposes to cover, the sampling design to 
be used, the designation of the parameters which he wishes to estimate, 
the magnitude or range of unreliability of estimates he is ready to accept, 
arough estimate of the dispersion of thecharacteristics unde investigation 
and the like. 5 

An adequate sample is one that contains enough cases to insure 
reliable results. Hence planning in advance for the size of the sample is 
very important. The procedure of determining the sample size required 
varies with the nature ofthe characteristic under study and its distribution. 
However, for the purpose of illustration, a case involving assumptions of 


. random samples bascd on normal probability distribution is given as 


follows : 

Suppose the researcher has decided (i) to use 95 per cent level of 
confidence, (ii) that for the purpose of study a 1 per cent error is 
acceptable; i.e., the sample mean does not differ from the population 
mean by more than 1 per cent , and (iii) has obtained preliminary estimate 
of the Standard Deviation of the population which is 10. He can, then use 
the formula meant for computing the Standard Error of Mean and through 
an inverse process work out the value of n or sample size. 

Formula for SE,, = с 


J n 
Value of standard deviate z at 95 per cent level of confidence=1.96 


Since the acceptable difference between the sample M and 
population џ is 1 per cent, the following equation is set up : 


1.00 


Ш 


1.96SE, 


1.96 Z 1.00 (substituting the formula given 
J n above) 


VA n = 1.96 с (Бу cross multiplication) 
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=. 


1.96 x 10 (Substituting the value of c ) 


Vn» = 19.6 


V n = 384.16 or 384 


Thus 384 cases are required to meet the condition ofa 1 percenterror 
at the 95 per cent confidence. If the researcher decides upon 1/ рег cent 
error and raises the confidence level to 99 per centaconsiderable increase 
in the sample size will be required. Solve : (2.58 SE (M) =.5). Generally, 
doubling the precision of a sample statistic calls for quadrupling the 
sample size. Another observation is also important to make. The size of 
n is also affected by the size of the standard deviation. The larger the 
standard deviation the larger the sample required. The smaller the 
deviation the smaller the sample size. Hence in the case of more 

“homogeneous populations, a relatively smaller size of the sample will be 
required. 

Parten’ has suggested a similar procedure for planning in advance for 
size of sample. The formula is 


N, = (с 2/1)? in which 

N, = The required sample size, 

с = а preliminary estimate of standard error units 
equal to a desired probability, and 

T = The permissible tolerance of variation in the 
sample mean. 

Z = The number of standard error units equal to 


desired probability. 


The main hindrance in the use of the above mentioned formulas is the 
question of obtaining a preliminary estimate of the population standard 
deviation. To solve this problem the following suggestions are offered for 
the benefit of the researchers. The population standard deviation may be 
obtained by 

1. Undertaking a pilot study, 
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2. Looking into the previous research on the subject for such 
information. 


3. Demming suggested the following formulas :. 


Value of SD in rectangular distributions = .29 h 
in Symmetrical distributions = 24h 
in skewed distributions = 21һ 


In which h = highest X - Lowest X; or the range. 


Hence the size of the sample is determined on the basis of variability 
of the population, the degree of precision required and the level of 
confidence at which the results are to be interpreted. However, the 
adequacy of the size of the sample does not automatically ensure accuracy 
of results. The sampling and measurement techniques ought to be 
selected and employed very carefully. 


1.4 Bias in Sampling 

A sample that is not representative is generally known as a “biased 
sample”. The very adjective “biased” connotes that the sample is not a 
good one or has been drawn with some prejudices and preferences in the 
researcher’ s mind or some of the units in the population have been unduly 
favoured in selection orothers wereata disadvantage. Y ule and Kendall'? 
have described the sources of bias in samples in the following words : 

Bias may be due to imperfect instruments, the personal qualities of 
the observer, defective technique, or other causes. Like experimental 
error, it is difficult to eliminate entirely, but usually may be reduced 
to relatively small dimensions by taking proper care — Experience 
has in fact shown that the human being is an extremely poor 
instrument for the conduct of arandom selection. Whenever there is 
any scope for personal choice or judgement on the part of the 
observer, bias is almost certain to creep in. 

Research in social sciences is replete with studies bearing the more 
superficial earmarks of erudition and authority that are intrinsically 
worthless and misleading for the sole reason that they are based on 
unrepresentative samples. Studies involving data collection through a 
mailed questionnaire often produce incomplete and distorted returns. A 
researcher may begin with the first list of the respondents which is a 
representative one but due to non-response, and the operation of other 
selective factors, he may end up with an extremely biased sample. 


LOER. Y., Won 
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It should not be inferred that all the ills and errors in social research 
are due to biased sampling. Many other sources of error are: the 
observation techniques, interviewing process, imperfection in the design 
of the questionnaire, tabulation plans and the processing of the material. 

Marks" argued in defence of the bias in the following words : 

Anotherarea of. serious misconception is the field of bias in sampling. 

The very term ‘bias’ suggests that itis undesirable— and it is. There 

are, however, conditions where it is better to use a biased estimate 

rather than accept the even more undesirable alternatives necessary 
to removing the bias. The criterion should be total error, which is 
composed of bias and variance. Ifavoiding a small bias means taking 

a very large variance, take the bias and keep the total error small. 

Mostof ourerrors of judgementare, however, in the reverse direction 

i.e., we strive for big samples (which means usually, small variances) 

and pay no attention to the sizes of the biases. The extremes of 

attending only to bias or attending only to variance are both 
undesirable, 

Cochran" has demonstrated the effect of bias onerrors of estimation. 
In the following diagram the two tails of the curveare affected different! y 
due to bias. 

Bias in statistical terms may be defined as : 

Bias or В = estimator (и) - true population value (и). 

Theamountof disturbance= B/o or the ratio of the bias to the standard 
deviation. It is generally accepted that the effect of the bias on the 
accuracy of the estimate is negligible if the bias is less than one-tenth of 
the SD of the estimate, 

As already stated - bias enters into sampling chiefly due to the 
following reasons : 

1. Use of non-random techniques. 

2. Use of unscientific and careless randomization. 

3. Errors of measurement. 

4. Non-response and experimental mortality. 


1.5 Sampling Errors 

Generally data are collected from the sample with the presumption that 
it was representative of the population. However, data collected from a 
sample may not be representative of the population. It is possible due to 
sampling errors and/or measurement errors. Sampling errors can be 
further classified into randomerrors and systematic errors. Measurement 
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t of Bias on Errors of Estimation 


Figure 1.2 Effec 
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errors also havea similar classification. The classification of these errors 
can be better understood with the help of the following diagram : 


Random Constant 


Measurement 


Figure 1.3 Errors of Sampling 


Cell I refers to the errors which occur due to sampling fluctuations, 
and are inherent in the very nature of random sampling. These are 
unavoidable whenever probability is resorted to. However these errors 
tend to cancel each other to the point that if n is sizable the sample statistic 
will get very close to the population parameter. As n increases the size of 
these errors decreases and becomes zero when п = N. It is also possible 
to estimate the magnitude of these errors by using a probability model. 

Cell II indicates sampling errors which do not cancel out but affect 
the result in one particular direction, For example if one were to sample 
the income of university teachers, and select the seniormost teacher from 
each department, the estimate is bound to be in error in the direction of 
a higher value. Similarly if one wishes to sample the monthly sales of the 
shopkeepers in the university market over a year and select the first week 
of every month, one is sure to end up with wrong estimate because 
university employees get paid in the first week of every month and hence 
the sales are the highest then, 
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This type of errors render the data unfit for any generalization. This 
is the worst source of bias in sampling which should be avoided. 

Cell III Sampling is а tool for another ultimate goal, ie. of 
measurement or collection of data or determination of the characteristics 
of a sample. Hence measurement errors get involved in any sampling 
results. Random errors of measurement are due to unreliability of the 
measuring tool. All measuring tools must mismeasure to some extent and 
be unreliable to that extent. However, repeated measurements on a group 
of subjects or increase in the size of the sample can lead to self- 
cancellation of these errors. 

Cell IV indicates constant errors of measurement which cannot be 
eliminated by increasing the size of the sample: these are directional and 
do not cancel each other. If a weighing machine is in error and gives 
weight short by one kilogram, it would introduce a systematic tendency 
on the part of the sample statistic to be higher than what it should be . 
However, careful planning and periodic checking and validation can be 
helpful in avoiding these errors. However, systematic errors are generally 


difficult to detect. 
Error reduction by judicious planning 

A judicious planning can lead to error reduction and save а lot of 
expense by using smaller groups. 

The Lamarkshire milk experiment” in England involved the daily 
feeding of three-fourths of a pint of raw milk to 5000 children and of an 
equal amount of pasteurized milk to another group of 5000 over a period 
of 4 months. These 10, 000 plus a control group of 10,000 were measured 
for height and weightat the beginning and the end of the 4-month period. 
The purpose of the experiment was to check at the relative merits of raw 
vs; pasteurized milk, the control group was non-essential. Despite large 
numbers the feeder and control groups were not comparable as regards 
initial heights and weights. The teachers, ignorant about the adverse 
effects on research, showed a benevolent attitude and gave preference to 
frail and undernourished children for inclusion in the feeder groups. A 
carefully supervised random sampling or a definite pairing procedure 
would have avoided this selective bias. The use of the expensive large 
samples would have been avoided if, according to "Student", the use of 
50 pairs of identical twins had been used. It would have yielded a more 
precise and accurate information at only 2 per cent of the cost of the 


original experiment. 
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1.6 Sampling Designs Classified 
Sampling designs can be broadly classified into two categories : 

Probability Sampling Designs and 

Non-probability Sampling Designs. 

The probability designs are based on random selection as the 
fundamental clement of control and permit the Specification of the 
precision that can be obtained and the size of the sample required for that 
purpose. The non-probability designs are based on the judgement of the 
investigator as the most important element of control. An investigator 
may be instructed to interview 100 persons passing a certain market 
crossing, interview on phone, a specific number of car owners, or 
housewives. The guiding principles in non-probability designs are : the 
availability of the subjects, the personal judgement of the interviewer, 
and the convenience in carrying out a survey. 

However, in non-probability designs, there is the risk of over- 
weightage of the cooperative and the available. Morcover, dependence 
exclusively on the investigator's insight does not lend the design to any 
statistical procedure for the purpose of determination of the margin of 
sampling errors. 

It may be possible in some studies to combine the probability and the 
non-probability procedures but such mixed designs are complicated and 
beyond the comprehension of an average student. 


1.7 The Criteria for Selecting a Sampling Design 

Young" has suggested three criteria which should be kept in mind while 

Sclecting or constructing a sampling design. 

l. A measurable or known probability sampling technique should be 
used so that the risk of errors inthe sample estimate can becontrolled, 
the degree of confidence that can be placed in the published figures 
can be pointed out and whether sufficient resources are available to 
get results from the sample with the reliability required, can be 
determined in advance. 

2. Simple, straightforward and workable methods adapted to available 
facilities and personnel should be used. 

3. Anattemptshould be made toachieve maximum reliability of results 
for each dollar spent. Striking at an optimum balance between 
expenditures and a maximum of reliable information should be the 
guiding principle. 


Fundamental Notions ој Sampling 27 


Generally itis advisable to conduct apilotstudy to uncover potential 
sources of difficulty, to provide the investigating staff with training in 
statistical as well as field work, thus ultimately saving time and expense. 


1.8 Sampling Frames 

A Sampling Frame is generally the list of sampling units from which the 
sample can be selected at each sampling stage. Indexes, maps and other 
population records used for the purpose are also included in this 
definition. Examples of sampling frames are telephone directories, 
directories of street addresses, electoral rolls, list of publishers of books, 
lists of schools and colleges in а state, rating records with the local 
authorities. Sometimes these lists, maps or indexes are in existence and 
can be readily obtained. Sometimes these have to be prepared at an extra 
costbefore sampling can beeffected. Sampling framesare very important 
and influence every aspect of a sampling design — the population 
coverage, the stages of sampling, the stratification used and the process 
of selection itself. 

The construction of a list of sampling units or a frame is often one of 
the major practical problems faced by any surveyor. From bitter 
experience, the samplers have become very critical of the readily 
available frames because good frames are hard to come by. Despite 
assurances from the agency that has constructed the lists, such lists are 
often found to be inadequate, incomplete, or partly illegible, or to contain 
an unknown amount of duplication. The problem of inadequacy ofa 
frame arises when it does not cover the whole of the population to be 
surveyed. А frame may be inadequate for one purpose andquite adequate 
foranother. A frameis incomplete when some of the population members 
whoare supposed to be on itare in fact not on it. Since these elements will 
have no chance of being selected, the sample will be unrepresentative of 
the population to that extent. , 

Another problem from the use of sampling frames arises when the 
sampling units are listed in clusters and not individually. For instance, if 
asample of individual teachers of primary schools in a state 15 to betaken, 
andonlyalistof. primary schoolsand notof teachers isavailable, selection 
by individual units will not be possible. Cluster sampling in which all 
elements in the selected clusters are taken may not always be a good 
procedure especially when clusters are 100 large and contamination of 
responses is possible due to communication among the elements of a 


cluster. 
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, in a survey of 


, each in а different street may 
appear four times at different places in the 


„ and absence of 


him. In case these do not enable him to 
accurately and conveniently, he may 
construct his own, 


1.9 Some Other Important Concepts 
Probability samplingis based on Some important 


Statistical Concepts. The 
most important among them is the law of large 


Numbers, 


Law of Large Numbers 
The law of large numbers states that, 


Beneral statement 
about the size of the difference between 5 andy. 
Itis intuitively obvious thata larger number provides а More precise 
estimate of a parameter 
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unless the estimator is asensible one. Forexample, average salary of 1000 
senior industrialist executives probably would not give a more accurate 
estimate of national average income than the average salary of 100 
randomly selected adult citizens. Hence, the average salary of senior 
industrial executives is not a sensible estimator for the purpose of 
estimating the national average income. Look at other examples. : 

The average salary of 1000 University Professors is not a good 

estimatoras compared to theaverage salary of100randomly selected 

teachers (including professors) for estimating the average salary of 

a teacher in a country. 

The average salary of 1000 bank managers is not a good estimator 

as compared to the average salary of 100 randomly selected bank 

employees (including managers) to estimate the average salary ofa 
bank employee. 

The reader may think of more examples which can be picked up from 
various fields and subjects : If the smallest or the largest value is taken, 
it is not a sensible estimator. 

Another concept, important to sampling isthe conceptofan unbiased 
estimator. If an estimator satisfies the condition that the average of its 
sampling distribution shall be equal to the true parameter value, it can be 


said to be an unbiased estimator. 


Central Limit Theorem and the Normal Distribution 
It is presumed that the researcher is aware of the concept of sampling 


distribution. However, for refreshing his knowledge we may define 
sampling distribution as the distribution of a statistic (like mean or 
variance) generated by calculating a large number of these randomly 
selected samples from the same population. CUN. 
Tis anastounding fact that notonly a articularsampling distribution 
approaches some regular form as sample size increases, but that it is 
alwaysthesame form, the so-called normal distribution. Thecentral limit 
theorem is a complete generality of this mathematical result that the form 
of the population from which we are sampling makes no difference 


provided that - 
(i) The more irregular 


the sample size we require. | 
(i) ^ Random sampling is undertaken for the purpose of inference. 


Random sampling tends to remove biases and enables us to estimate 
unbiasedly. It allows for the estimation of precision of our estimator from 


the distribution in the population, the larger 
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the same sample. 


The central limit results apply not only to sample average, but to 
practically all the estimators (like variance). 


1.10 A Generalized Model of Sampling 

Sampling has the important Objective of helping the researcher in 
producing good research, However, there isa constant interplay between 
the objectives and Strategies of research adopted and the strategy of 
sampling to be utilized, Very often the former gets an upper hand. In the 
Process of sampling, many decisions are lo be takenatseveral stages. The 
followingdiagram outlines the process of these decisions, while selecting 


research design can be selected 

Three importantconstraints which are faced by an investigator while 
selecting a sampling Strategy аге: 

1. Objectives of the study. 

2. Type of the Study. 

3. Resources available for the Study. 

If the objective of research i 
of the study toa small local gro 
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SAMPLING STRATEGY 


Objectives 


Tar get Population 
and Sampling Unit 


Resources Type of study 


Equipment 


Sampling Technique 


Judgement 
Sampling 


Sampling Error 
Estimate not possible 


Fig 1.4 A Generalized Model of Sampling 
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Important New Ideas 
SAMPLE ARTIFICIAL POPULATION 
SAMPLING FRAME HYPOTHETICAL POPULATION 
POPULATION SAMPLE SIZE 
TARGET POPULATION BIAS 
EXPERIMENTALLY SAMPLING ERRORS 
ACCESSIBLE POPULATION | MEASUREMENT ERRORS 
FINITE POPULATION PROBABILITY SAMPLING 
LAW OFLARGE NUMBERS  NON-PROBABILITY SAMPLING 


NORMAL DISTRIBUTION 
RANDOM SELECTION 


ESTIMATOR 
CENTRAL LIMIT THEOREM 


] 2 


SIMPLE RANDOM SAMPLING 


2.1 The Technique 
SAMPLE studies deal with 5 
populations which contain а 
all be distinguished from on 
Size n that can be drawn fro 
formula :! 


amples drawn from finite populations or 
finite number, N, of units. If these units can 
е another, the number of distinct samples of 
m the N units is given by the combinatorial 


N! 
GC) Gs „Сп агуу – 


Ш 


For illustration, if a Population contain 


5 4 units denoted byA,B,C 
and D, there are six di 


fferent Samples of size 2, as below :- 


AB AC AD BC BD CD 

In fixing these combinations, the same letter is not Tepeated in any 
Sample and the order of the occurrence of the letter has been ignored. For 
example, AB and BA are identical and 50 are AC and CA, and AD and 
DA. 
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n" Definition of the population | 


| 


| Preparation of Sampling Frame 
(in individual units ) 


| Decision About Size of sample | 


Random Selection Strategies 


Random Computer 


Number Pseudo Number 
Tables Random Draw 


Figure 2.1 The Process of Simple Random Selection. 
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individual units and not of clusters or groups. For example, if the 
experimental unit is a student, list up all the individual students and not 
simply classrooms or schools. If the study is to be conducted on 
households, list up all the households and not blocks or sectors. Generally 
the researcher need not be concerned as to how or in what order the 
populationelementshave been arrangedin the sampling frame. The order 
— high magnitude to low magnitude or reverse; first come first listed; — 

will not make any difference as the Tepresentativeness of the sample will 
be taken care of by the randomization process. A decision about the size 
of the sample should be based on the considerations already mentioned 
in Chapter 1. However, samples on human beings should be fairly large 
50 as to ensure reliability of results. A sample of less than 30 units based 


on random selection will generally be dubious in terms of its 
representativeness. 


» at the randomness 


However, there are some Serious objections to these procedures. The 
objects may not be such as could be mixed. The elements will have to be 


numbered. The numbers will then be mixed and drawn. The objects or 
elements] -aming those iden 


nts. tification numbers will then be picked up for 
inclusion in the sample. 
Another objection is 
perfect balls, perfect discs 
‘chirdly, the condition 


that it is difficult to constr 


uct perfect coins, 
» and perfect urns, 


Performed by careful expe 
random numbers - 
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involving n's running into thousands, and having large sampling frames, 
the use of random numbers or manual lot-drawing will be too 
cumbersome to recommend. In such situations, computer generated 
random selection should be resorted to, to cut down on the time and labour 
involved in other methods. 


2.2 The Tables of Random Numbers 

Tables of random numbers have been prepared by Kendall and Smith? 
Fisherand Yates, and Tippett,‘and constitutea very convenientand most 
objective method of random selection. From the members of population 
already numbered from 1 to N, the required number of units are selected 
from one of these tables in any convenient and systematic way. For tables 
of random numbers and guidelines about their use see Appendix. These 
tables are so prepared that all the ten numbers from 0-9 have an equal 
chance of being selected. If we examine these tables, we will see that 
whether we go down a column or across à row, there is no discernible 
pattern. The numbers are computer generated andare truly random. If we 
count up the number of O’s,1’s, 2’setc., from any of these tables, one will 
notice that their number is approximately equal. Students may gencrate 
their own tables of random digits by using ordinary lottery method on the 
ten digits - 010 9 -and listing up the numerals drawn each time. However, 
this is unnecessary keeping in view the fact that these tables are readily 
available in any book on statistics on sampling theory. 


2.3 Sampling Without Replacement , 

The method and formula given in the beginning of this section pertains 
to sampling without replacement where a unit drawn is not put back or 
replaced in the population before another unit is drawn. Thus the size of 
the population doesnot remain the same with every unit drawn. The table 


given below makes it amply clear : 
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Table - 2.1 


Relationship of the Element Position to Number of Choices and 
Probability in Sampling without Replacement 


Position of Number of Probability 
element in the choices 

sample (possible) 

Ist N 1/N 

2nd N-1 1/(N-1) 
3rd N-2 1KN-2) 
(n-1)th N-n+2 1(N-n+2) 
nth N-n+1 1/(N-n+1) 


assumption that all Ке 


n samples have an equal p 
to verify also. 


robability and it is easy 


2.4 Sampling with Replacement 
Insampling with re 
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AAA BBB ccc DDD 
AAB AAC AAD ABB 
BBC BBD ACC BCC 
CCD ADD BDD CDD 
ABC ABD ACD BCD 


Thus by enumeration we arrived at 20 possible groups with no two 
groups being identical. The number of possible groups of size n from a 
population of N units, using the method of sampling with replacement is 
given by the formula : 


(N+n-1)! 


n!(N-1)! 
In the above problem the number of possible groups as given by the 
formula is : 


(443-1)! 6! 720 


= 20 


31(4-1)! 3! x 3! 6x6 
(checks with those enumerated above) 
Sampling with replacement is entirely feasible but except in special 
Circumstances, is seldom used. There seems little justification in having 


the same unit twice in the sample. e : 

In sampling with replacement the number of possible choices and the 
Probability of the selection ofeach item, regardless of whatelements have 
been drawn previously, remain the same for each item. The table given 


below shows this relationship. 
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Table - 2.2 


Relationship of the Element Position to Number of Choices 
and Probability in Sampling with Replacement, N- 100 and n = 10 


Position of Number of Probability 

element in the choices 

sample (possible) 
— MÀ Qe 

Ist 10 1/10 

2nd 100 1/100 

3rd 100 1/100 

9th 100 1/100 

10th 100 1/100 


The method of random selection 
Conveniently used and serious practical d 
in the following types of situations: 

(1) When the Population is very large; 
(i) When the population is not finite, the method becomes 


With replacement cannot be 
ifficulties may be experienced 


(iii) When the nature of units is such that once a unitis drawn it dies 


back in the aggregate; 
(v) Whensamplesof mutually exclusi 


a population of 1000 in М*= 1000? = 1000, 
replacement; and in (N), = 1000 x 999 = 


without replacement. The latter figure is only 0.1 рег cent smaller than 
the former. 


Before concluding this introduction to sampling with replacement, 
three additional remarks can be offered. Firs; 


tly, when Sampling with 
replacement, it is possible for the sample si 


ге to be greater than the 
population size and still not have observ 


ed every unit in the finite 
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population. This cannot occur when sampling without replacement. 
Secondly, the variance of the mean when sampling with replacement is 
already larger than when sampling without replacement. Thirdly, in the 
more mathematical presentations of statistical methods, discussion of 
sampling with replacementis much more common than sampling without 
replacement. There are at least two reasons for this. One is that it is 
popular toassumethatthe sampled population isinfinite in size. The other 
is that sampling with replacement is associated with more tractable 


mathematics. 


2.5 Terminology Used 

In the preceding pages a clear distinction between the terms random 
sampling without replacement and random sampling with replacement 
has been made. However, due to the emergence in literature of no 
standard terminology to distinguish the two, a looseness in their use has 
been noticed. Kish uses “Simple random sampling” for the former and 
“Unrestricted random sampling” for the latter. Cochran’ also uses 
“simple random sampling” for sampling without replacement, but 
suggests that "unrestricted random sampling"isjustan alternative phrase 
for “simple random sampling”. Yule and Kendall® use "simple sampling” 
to mean sampling with replacement. In the present work the author has 


also followed Kish. 


2.6 Concluding Remarks х | 
The foregoing discussion would remove the layman's misconceptions of 


random selection as something haphazard, careless, unplanned, hit and 
miss and something involving exercise of no conscious selection on the 
part of the investigator. Such ideas are far from correct. However, simple 
random sampling is the simplest probability design calling for no special 
expertise and training or even insight. It can be used mechanically by 
anybody who has all the population elements listed and a set of tables of 


random numbers. / 
At the same time, it allows for the control of sampling error. Buta 


randomly chosen sample may look most *unrandom". If a sample, by 


chance, contains no woman although there may be a fair proportion of 


women inthe population, the sample isunrepresentativeof the population 
andhencecannotreflectitstrue characteristics. Butsuch “extraordinarily 
unlikely” events may Occur, such a sample being close to the tails of the 
sampling distribution. If the investigator goes on taking large number of 
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such samples in precisely the same way, on the average, the estimates of 
the characteristics derived from them all would correspond to the true 
characteristics in the population. 

The unrepresentativeness of a single sample does not throw doubt on 
the randomness of the sampling procedure. In cases where геоссштепсе 
of such "unrepresentative looking" samples is detected, the investigator 
will have grounds for Suspicion and will be well advised to review the 
Procedures of sampling used by him. It may, however, be understood that 
the randomness of the Process can be gauged by Studying the results of 
Tepeated samples, not by the appearance of a single one. 

Before concluding the discussion on simple random method, it 


would be appropriate to mention the precautions necessary to avoid 
inadvertent departures from randomness : 


1. The definition of the Population and of the observations should 
be precise and coinciding with each other. 


2. The definition or list of all the Population elements should be 
complete. 


3. Themechanical procedure of 


drawing the sam pleshouldbe easy 
to carry out and should not 


allow any biases to enter.” 
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Important New Ideas 


SIMPLE RANDOM SAMPLING 
SAMPLING WITH REPLACEMENT 
SAMPLING WITHOUT REPLACEMENT 
UNRESTRICTED RANDOM SAMPLING 
EXPERIMENTAL UNIT 

RANDOM NUMBERS 


3 


SYSTEMATIC SAMPLING 


Thusa variation of the random proce. 
selection of the required number of el 
included in th 
l. Lis Up the population 


2. Determine the desired sa 
and also K = N = 1000 


n 109 10. 


»the technique is also 
known as Sampling by Regular intervals”, “Sampling by Fixed 
Sampling by Every kth Unit » 

3.2 Systematic Compared with Simple Random 

The procedure of systematic Selection is easier and more convenient than 
the simple random sampling, Itprovides amoreeven Spread of the sample 
Over the population list and hence Jeads to greater precision, But, can we 
Consider systematic Sampling to be equivalent to simple random 


Systematic Sampling 45 


sampling? Strictly speaking, systematic sampling is not equivalent to 
simple random sampling except in cases where the population list is ina 
random order. But generally ordinary lists are not so. Order the liston any 
basis, or get hold of any list already prepared, there is invariably some 
systematic arrangement. Furthermore, the choosing of the first number 
randomly docs not make it so. It docs not provide all the possible samples 
of size n an cqual chance of being selected. Morcover, the random 
selection of the starting points determines the other units of the sample, 
hence the whole sample. 

This dependence or linkage of one member of the sample on the 
previous one, makes the process different from simple random method, 
in which selection of every member is independent of the other. 

Moser and Каноп! have used the term Quasi-random sampling for 
systematic samples selected from lists arranged more or less at random 
or when the feature, on the basis of which itis arranged, is not related to 
the subject of the survey. The selection at regular intervals, from such а 
list, can be considered as approximately equivalent to simple random 
sampling. 

Cochran? mentions the advantages of systematic sampling over 
simple random sampling in the following words : 

1. Itis casy to draw a sample and often easier to execute without 
mistakes. This is a particular advantage when the drawing is done 

in the ficld. Even when drawing is done in an office there may be a 

substantial saving in time. For instance, if units are described on 

cards that arc all of the same size and lic in a file drawer, a card can 
be drawn out every inch along the file as measured by a ruler. This 
procedure is speedy, where as simple random sampling would be 
slow. Of course, this method departs slightly from the strict “ every 


kth” rule. 


2. Intuitively, systematic sampling seems likely to be more precise than 


simple random sampling. In effect it stratifies the population into n 
strata, which consist of the first k units, the second kunits and so on. 
We might, thercfore, expect the systematic sample to be about as 
precise as the corresponding stratified random sample with one unit 
per stratum. The difference is that with the systemauc sample the 
lative position in the stratum, whereas with 


units occur at the same ге! onar à э 
the stratified random sample the position in the stratum is determined 
separately by randomization within each stratum. The systematic 


sample is spread more evenly over the population, and this fact 
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sometimes made systematic sampling considerably more precise 
than stratified random sampling. 
The relative position of units selected through systematic sampling 


and those through stratified random sampling is shown in the diagram 
given below : 


x =systematic sample o =stratified random sample 


e dee M al о los 
3 4 


5k 6k 
Unit number 


Figure 3.1 Relative Position of units selected through 
systematic and stratified random sampling 
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The figure given above shows that the relative position of a unit 
selected from each k through systematic sampling remains the same. But 
the relative position of a unit selected from each of the strata through 
stratified random sampling does not remain the same except in a few 
strata out of a large number of them. In the latter type of sampling, the 
relative position of a unit in each stratum is determined separately by 
randomization within the stratum. 

Allthings being equal, simple random sampling is to be preferred for 
accuracy over systematic sampling. That is systematic sampling may be 
viewed as a practical approximation to random sampling. While random 
sampling has the advantage of being accurate (if done properly) and does 
not require the assumption of a randomized sampling frame, systematic 
sampling is more practical in that it is less work and thus provides more 
information per dollar. Also, because itis simpler to perform (especially 
for inexperienced survey workers), it may reduce error. In general the 
more complex the method, the greater the opportunity for error. * 


33 Periodic Effect 
There isone risk. When the list hasa periodic arrangement andthe sample 
interval coincides with the periodic interval or multiple of it, а systematic 
sample will yield poor results. From alistof married couples listed in pairs 
with the husband first, a systematic sample, with k = 2, and 1 as the 
randomly selected first member, will comprise all husbands only. If the 
starting number is 2, all wives with the exclusion of husbands will be 
selected. P i 
Some populations have akind of periodic effect which is implicitand 
notsoobvious. A monthly pay-roll of teachers ina school may always list 
the teachers in the same order. If only one name is to be drawn over à 
number of months, the sample may consist mainly of the salary figures 
of one teacher or two or three teachers. In such cases a simple or stratified 
random sampling is preferable. Madow‘, Milne*and Finney*have shown 
that in some natural populations, some quasi-periodic variation may be 
present that would be difficult to anticipate. The effect of quasi-periodic- 
ity is that systematic sampling performs poorly at some values of n and 
particularly well at others. Sometimes the process of measurement mày 
introduce this effect. 
Populations with тог 
uncommon to find. Students' atten 
Six working days of the week, flow of ro 


е or less definite periodic trend are not so 
dance at a residential university over 
ad traffic past a particular point 
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on a road over 24 hours, sales over four weeks of the month of a grocery 
Store having a majority of monthly paid salaried customers, and the 
number of hours the students spend in study during different terms of an 
academic year in a system depending on promotions on the basis of one 
final examination towards the end will show periodic trend. 

According to Cochran Periodic trend ma 


y take the shape of a sine 
curve" as shown below : 


Figure 32 Sine Curve Showing periodic effect 


* A sine curve is a curve which ch i i " d 
periodically. anges in magnitude continuously and in direction 
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In such situations effectiveness of the systematic sample depends on 
the value of K. In Figure 3.2 above the height of the curve is the 
Observation yi. The sample points A represent the case least favourable 
to the systematic sample. This case holds whenever K is equal to the 
period of the sine curve or is an integral multiple of the period. Every 
Observation within the systematic sample is exactly the same, so that the 
sample is no more precise than a single observation taken at random from 
the population. The most favourable case is B. It occurs when K is an odd 
multiple of the half period. Every systematic sample, in this case, has a 
mean exactly equal to the true population mean, since successive 
deviations above and below the middle line cancel. Between these two 
cases the sample has various degrees of effectiveness depending on the 
relation between k and the wavelength. : 

Hence the performance of systematic sampling in comparison with 
that of simple random or stratified random is to a great exient dependent 


Оп the properties of the population. 
Another limitation of the systematic sampling emerges from the fact 


that very often nisnotan integral multipleofk. This leads to samples from 
the same finite population varying by one unit in size. With N= 33,k= 
5, the size of the various possible samples would be as under : 


Table - 3.1 


The Possible Systematic Samples with 
Their Size (N=33, k=5) 


Sample Starting Number Numbers Chosen Size 
i 1 П б ail 16; 21020 3107 
ii 2 2 97. 19.44. 22.27; Вот 
iii 3 3.198. ЗЕ 23:523 2:35.97 
iv 4 4 9 14, WO) 24, 29, 6 
M 5 58 10. 15 20 25; 30 6 


The first three samples have п=7, while the last two, n-6. This fact 
introduces adisturbance into the theory of systematic sampling especially 
Whennissmall. However, in samples of size 50 and more, this disturbance 


тау be negligible. 


50 


Better Sampling : Concepts, Techniques and Evaluation 


A variation of the systematic sampling is to choose each unit at or 


near the centre of the stratum. Thus, instead of following the popular 
method of starting the sequence by a random number chosen betiveen 1 
and k, the sampler takes the starting number as (k+1)/2 if k is odd and 
cither k/2 or (k+2)/2 if k is even. Logically this procedure is defensible 
because in cases of continuous variables, there are grounds for expecting 
the centrally located sample to be more precise than the one randomly 


located. However no research evidence has been brought to bear upon the 
efficacy of centrally located samples. 


3.4 Some Practical Hints 


о = 


Be 


t3 


Use a random start. · 

In casc ofa very large sample, change the random start several times 
to avoid trends or periodic fluctuations. 

Shuffle the list thoroughly. 


As far as possible have a sam 


pling frame already arranged 
randomly, 


These precautions will introduce the element of randomization in 
sampling based on systematic selection, 
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Important New Ideas 


SYSTEMATIC SAMPLING QUASI PERIODICITY 


PERIODIC EFFECT SAMPLING BY FIXED INTERVALS 
SAMPLING BY REGULAR SAMPLING BY EVERY Kth UNIT 
INTERVALS SINE CURVE 


4 


‘STRATIFIED SAMPLING 


4.1 The Technique 

IN А PREVIOUS chapter simple random sampling technique was explained 
as the one basic to many other sampling techniques. It was also pointed 
out that the precision of a simple random sample can be raised by 
increasing its size. However, itis not the only way. A very popular method 
to increase the precision is stratification of the main population into a 
number of sub-populations each of which is homogeneous with respect 
to one or more characteristics and then to select randoml: 
number of cases from each sub-population. 

In symbolic terms the population of N units is divided into sub- 
populationsof N,, N,N,...N, ,unitsrespectively. Thesub-populationsare 
called strata. These sub-populations are non-overlapping and together 
comprise the whole of population so that 

NENEN F. EN SN 

A sample is then drawn from each sub-population independently of 
other sub-populations. This may involve random, systematic ог any other 
mode of selection. If sampling from each stratum is done randomly the 
method can be designated as stratified random sampling". Ifsystematic . 
sampling is used, it may be called "stratified Systematic sampling." The 
latter is not equivalent to the former. At the most, it can be called 
“stratified Quasi-random Sampling". However, stratification. does 


improve the efficiency of both the random selection and the systematic 
selection. 


y the required 


The steps involved in the stratified random sampling are enumerated 
below: 
1. 


Decide upon the relevant stratification factors as Sex, residence, age, 
courses of studies etc. 


Divide the entire population in sub-populations based on the 
Stratification criteria, 


2. 
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3. List up the units separately in each sub-population. 

4. Select the requisite number of units from each sub-population by 
using an appropriate random selection technique. 

5. All the sub-samples thus selected make up the main sample. 


4.2 Stratification Criteria 

There are various factors on which stratification is often done. Selection 
of these factors depends upon the nature of the study, the various 
dimensions included and the nature of the population to be used for the 
purpose. The factors commonly used for the purpose are : Sex, age, 
income, heights, weights, educational status, I.Q., residence, volume of 
business, number of employees, strength of students, courses of studies, 
grade, caste, cultural level, religion and the like. 

However, there are three essential requirements for the selection of 
criteria of stratification. Firstly, the criteria for division of population into 
strata should be correlated with the variable being studies. In a survey of 
achievement in Mathematics of a particular age group of students, the 
colour of the eyes - black, brown, blue etc. - or the weights of the students 
cannot be considered to be appropriate criteria for stratification because 
these variables do not have any known relationship with achievement in 
Mathematics. However, the educational level and the economic level of 
the parents may become logical bases of stratification. Secondly, the 
criteria should be practical. These should not provide so many sub- 
samples as to increase the size of the required sample over that required 
by simple random technique. For example, ina study of fertility rates in 
the population ofasmall town with population around ten thousand, race, 
religion, education, socio-economic-status andagehave been considered 
to be appropriate criteria for stratification. Furthermore, if there are three 
categories of race, two of religion, three of education, three of socio- 
economic status and four of age, the number of strata would equal 
3x3x3x3x4 or 324. Since a statistically satisfactory number in the 
smallest cell could not be less than ten cases, the minimum number of 
Cases thus required would be 3,240 which will be considered enormous 
for the small town. Hence the precision gained through stratification will 
be neutralized by higher costs of data collection. However, this 
Observation applies to surveys in which only overall estimates are to be 


made. If estimates are wanted also for all the aa id de 
Populati largernumberof stratais stronger. Thirdly, 
Re paced эу ОТ teria should be available. If no 


а good measure of the stratification cri 
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reliable and valid tool of assessing socio-economic status was available 
or categorization into distinct races was not clearly feasible these criteria 
would lead to confounding of the results. 

To solve the dilemma between the number of criteria and the 
unmanageable increase in the sample size required, it may be assumed, 
on sound grounds, that many such variables occur as associated factors. 
In this way, socio-economic status may be chosen to stand also for 
education and in some cases for religion. In case, the community is fairly 
homogeneous on race, this criterion can also Бе kept out in stratification. 

When sampling is stratified the Standard Error formula for the mean 


differs from the Standard Error formula in simple random sampling. The 
SE,, in this case will be : 


2.02 
SETS u^ SAUCE in which 


N 
с = Standard deviation of the entire sample 
б, = Standard deviation of the various strata means around 


the mean of the entire sample. 
The standard deviation of the means of the Strata around the mean of 


the entire group can be calculated through the following formula which 
is just an extension of the usual SD formula : 


c lx AN, (M,-M)? + ммм) +. (мум) 
кашне меша 
N 


where №, N,,..N, stand for the number of case 
M denote the size and mean of the whole sai 


sin strata 1 tok; and N and 
трѓе. 


4.3 The Problem of Proportionality 
When thestrata have alread 


into sub-populations, the Question arises 
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or other reliable sources of descriptive data. 

For the purpose of illustration, suppose a survey of general 
intelligence of school-going children of the age group 14+ to 15+ in à 
district is to be carried out. The relevant stratification criteria are -sex and 
residence (Urban, Suburban and Rural). The school census data of the 
district show the following percentages of each category in the population 
of 14+ to 15+ school going children : 


Population (All 14+ to 15+ school-going children) 


Boys 60% Girls 40% 


| | | | | 
Urban Suburban Rural Urban Suburban Rural 
20% 25% 55% 60% 25% 15% 


Suppose further that a sample of 1000 has been considered to be 
adequate for the purpose. The distribution of various strata in the final 
Sample will then be as follows :- 


Urban Boys 120 Urban Girls 240 
Suburban Boys 150 Suburban Girls 100 
Rural Boys 330 Rural Girls | 6 
Total Boys 600 Total Girls 400 

„Эйе зерек =й ee 


Fig. 4.1 Proportional Stratified Sampling 


the stratified proportional random sampling requires 


In this way, 1 
mid h stratum in proportion to the actual 


Selection of units at random from eac 
Size of the group in the total population. 

Stratification, if done wisely, improve 
Sample. The proportional weighting of eac 
and an investigator may use а smaller sampl 
efficiency at reduced costs. 


s the representativeness of the 
h criterion improves it further 
е and achieve higher 
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4.4 The Optimum Allocation 


Thereare circumstances when disproportionate numbers are drawn from 
various strata. In cases where a special interest is attached to either the 
difference between two or more particular strata or upon intensive 
analysis of one stratum, disproportionate sampling may be used. The 
results are proportionately reduced at the time of tabulation of data for the 
whole sample. Situations may arise in which some sub-populations are 
much more variable than others. Itmay be due to the fact that the members 
in those sub-populations are less mixed or variable with respect to the 
characteristic or attribute of interest and hence are more difficult to 
represent by a sample of given size. Hence for the purpose of increasing 
overall precision, a larger sampling fraction may be taken from such a 
stratum. 

Another important consideration is the cost per sampling unit. If the 
cost per sampling unit (travelling costs and measurement costs) in a 
particular stratum is greater than in others, asmaller fraction may be taken 
in costlicr stratum. These considerations giverise to the idca and concept 
of optimum allocation in various strata. For further study of the subject 
the reader is referred to Kish,! Yates,? Neyman; Cochran‘ and Stuart’, 
who have given various theorems and their proofs. 

However, for the purpose of 
principles concerning disproporti 
in view : 

In a given stratum, take a larger sample if : 

(i) The stratum is larger, 
(ii) The stratum is more variable internally, and 
(iii) Sampling is cheaper in the stratum. 


general understanding the following 
onate stratified sampling may be kept 


However, there are some problems in the Practicability of optimum 
allocation. The investigator generally does not know, before hand, the 
relative variability and the relative costs in the strata, Moser and Kalton® 
suggest the following ways of Overcoming such limitations : 


(i) Obtain guidance from the Previous surveys using the same or similar 
populations, 


(ii) Ascertain the estimates of standard deviation and cost through a pilot 
survey, 


(iii) If the strata standard. deviations are difficult 


1 to estimate directly, 
relate the sampling fractions to some other 


measurement (strata 
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means) which is itself known orexpected го berelated to the standard 

deviations, and 
(iv) Use expert judgement. 

However, all these methods produce approximations rather than 
exact figures, but these may be sufficient, because small departures from 
the optimum allocation lead to only a slight loss of precision. Moser and 
Kalton? further state, “If the sampling fractions are close to the ‘best 
fractions’, there is still likely to be as much of gain as if the best fractions 
had been used.” 

Another issue in stratification that needs attention is the problem of 
optimum allocation in two-way stratification. with small samples. The 
problem becomes all the more difficult if the size of the sample (n) is less 
than the product of the levels of the two strata. For instance, if there are 
five levels of income and three levels of residence, the question of 
selecting a sample of size 10 will pose certain problems. Bryant, Hartley 
and Jessen? have developed a simple technique which requires that the 
size of sample exceeds the greater of the number of levels of the two 
criteria. In the case mentioned above, n=10 exceeds the number of levels 
of income i.e. 4, thus permitting the use of the technique. The method 
involves arranging of the population in a two-way stratification table and 
allows approximately equal chance of selection to each unit, at the same 
time giving each marginal class its proportional representation. For 


further details of the technique consult Cochran.’ 
Goodman and Kish’? have suggested another method called 


“controlled selection" which can be used efficiently even if a substantial 
number ofcells are empty. In this method, therowsrepresent the principal 
Stratification, one unit being drawn from each TOW. They have 
demonstrated the technique which involves finding a limited number of 
acceptable allocations, cach with its appropriate probability, such that 


cells are selected with equal probabilities. 


4.5 Stratification after Selection 
In some studies stratification may not be possible before the data have 


been collected, The stratum to which a unit belongs may not be known 
until the investigator has actually gone in the field to conduct the survey. 
Personal characteristics such as Sex, Гасе, educational level, age are 
examples of such stratification criteria. The procedure involves ои of 
asimple random sample of the required size and then classifying the units 
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into various strata. The method is quite efficient provided the sample is 
reasonably large, i.e. greater than 20 in every stratum and the effects of 
errors in the weights of strata can be ignored. 
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Important New Ideas 


STRATIFICATION STRATIFIED SYSTEMATIC 
SAMPLING 

PROPORTIONAL WEIGHTING STRATIFIED QUASI-RANDOM 
SAMPLING 


OPTIMUM ALLOCATION STRATIFICATION CRITERIA 
SUB-POPULATIONS CONTROLLED SELECTION 


5 


CLUSTER, MULT. I-PHASE AND 
REPLICATED SAMPLING TECHNI QUES 


5.1 Cluster Sampling 
CLUSTER samplin g involves d 


cluster is an intact Broup as available in the field. It j 
researcher for data collection. 


divided into 600 blocks of 50 houses each, a Simple course would be to 
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select at random six blocks, i.e. 1 per cent and include all the households 
in them in the sample. 

In this way, the sample instead of being scattered over all the blocks 
will be confined to only six blocks. Suppose the information were to be 
collected by direct interviewing it would be more economical to contact 
the households because of the proximity of the same in six compact areas. 
In addition to this the advantage in the use ofa cluster and not a household 
as sampling unit receives added importance in cases where an adequate 
listing of all the units of investigation in the population was not readily 
available. Listing of thése units through direct contact would be highly 
expensive and time-consuming. Cluster sampling has been described in 
social surveys as “area sampling" and “area probability sampling". The 
latter term is somewhat misleading because simple random sampling can 
be used with areal units. Moreover, the area probability sampling is based 
to some extent on the principle that areal probability can be used for 


sampling purposes. | 
5.2 Some Other Cluster Sampling Designs 

The examples given in the preceding pages illustrate the simplest method 
ofdrawingacluster sample. Thereare many alternative waysofobtaining 
acluster sample which involve more complex systems such as two-stage, 
three-stage and multi-stage sampling designs. The design may call for 
Starting with larger units, just as provinces in a country; obtaining an 
initial sample of them; from the provinces so selected, choosing a specific 
number of districts; from the chosen districs, selecting a number of sub- 
division; and from the sub-divisions drawing a sample of farms, stores ог 
households with equal probabilities of inclusion or with varying 
probabilities. It may also involve special procedures for sampling stores, 
hotels, farms, institutions and other special groups of the population. 


5.3 Multi-phase Sampling 

Multi-phase sampling involves 
Whole sample and additional in 
from sub—samplesofthe fullsamp 
survey ofa large sample, an investigator m 


for a more comprchensive investigation. 
For example, an investigator plans 10 conduct a household 


expenditure survey. The information to be collected is decided upon. 


collection of some information from the 
formation either at the same time or later 
le. After makingasimple inexpensive 
ay select sub-samples from it 
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A sample of households is selected but each household is not questioned 
onall the matters being covered. Only the basic data-size and composition 
of the household, occupation of the head, income etc., are collected. 
Information regarding other important aspects such as distribution of 
expenditure over the main items may also be sought from the entire 
sample, because the analyses of resultsare to be basedon thisinformation. 
However, there may be some less important matters on which the 
investigator does not require detailed analyses of such high precision. 
Seeking less important type of information from the entire sample may 


during the first phase May be us 
selection of the sub-sample. 
Post-stratification ratio and Te 
also be used to improve the preci 
effects of non-response in th 


gression estimation techniques can 
sion of the sub-sample results, The 
€ sub-sample can be estimated, 
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unrepresentativeness of the remaining sample can be gauged on the basis 
of the basic data, and if necessary, re-weighting can be done to counteract 
the effects of unrepresentativeness. 

However, it is important to know that the use of two-phase sampling 
for the sole purpose of increasing the precision of sub-sample results is 
effective only if the cost of data collection is considerably lower for 
members of the first phase sample than those of the sub-sample. Moser 
and Kalton! suggest that if two-phase sampling is to be useful, the gain 
in precision resulting from the use of first phase data to improve the sub- 
sample results must outweigh the loss in precision resuliing from the 
reduction in sub-sample size. This is likely to occur only if the cost per 
individual of collecting the data at the first phase is cheaper than that at 
the second phase by a factor of at least, say ten. This degree of variation 
in costs сап occur, for example, when the first phase information is taken 
from some form of records or collected by mailed questionnaires while 
the second phase information is obtained in personal interviews. 

Two phase sampling may be useful in studying rare populations such 
as those of persons with rare diseases. The first phase may help in 
Preliminary securing through an inexpensive method a large sample to 
detect the positive and negative cases and in the second phase the sub- 
sample for the purpose of more expensive and thorough examination. 

Cartwright ina study of “Human Relations and Hospital Care” used 
the technique of two-phase sampling to obtain a sample of persons who 
had been in hospital in a set period shortly before the survey was 
undertaken. The first phase sample consisted of 29,400 persons selected 
from the Registerof Electors. Through abrief postal questionnaire, itwas 
ascertained whether they had been in hospital during the period in which 
the investigator was interested. During the second phase, all the 1119 


Persons who had replied in the affirmative were interviewed. During the 


interview, 15 per cent of this sub-sample were found not to have been in 


hospital during the specific period, and hence their views about the 
hospital service were not sought. However, the views of the 100 per cent 
Sub-sample taken from the positive stratum were recorded. А 
The Population Investigation Committee and the Scottish Council 
for Research in Education? conducted a survey of intelligence. A first 
phase sample of 80,000 children was used for group intelligence tests and 
for the main questionnaire. А more detailed questionnaire was then 
addressed to а sub-sample of children bom on the first three days of each 
month. A further sub-sample comprising those born on the first day of 
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each alternative month was used for individual intelligence tests. 
Multi-phase sampling is a popular technique in population censuses 
in various countries. In the 1961 Census in Britain, certain data - basic 
items like sex, age, marital condition and also items like number of living 
rooms, sinks, bathsetc., were collected for the whole population and other 
data - e.g. occupation, place of work, age at which full time education 


ceased, qualifications in science or technology - were asked of only a 10 
per cent sample. 


5.4 Replicated Sampling 

Deming* has discussed, in detail and with numerous illustrations, the 
technique of replicated or interpenetrating sampling. 1n the use of this 
technique a number of Sub-samples rather than one sample are sclected 
from the population. Each of the samples is based on a uniform sampling 
design, is self-contained, and adequate. For the purpose of selecting these 
Sub-samples any basic sampling design with stratified or non-stratified, 
single or multi-stage, single or multi-phase approach can be used. The 
essential pre-conditions to be fulfilled include the independence of each 
Sub-sample from the other, the uniformity of the sampling design and 
complete coverage of the population in each trial of selecting the sample. 


For example, Suppose a survey of level of Mathematical 
achievement of 8th class male stud 


Schools in a district is to be с 
Mathematicsis to be administere 
1000 students. The fullsamplec 


each, or five sub-samples of 200 each, or ten sub-samples of 100 each, or 


ub-samples is required. 


Ted to the sub-samples, 
le wise. The sub-sample 


x Ifan overall sample is selected by using k independent sub-samples, 
Zi is the estimate for the ith Sub-sample. The overall estimate will then 
be given by the mean of those estimates : i 


Z= EZ 


Cluster, Multi-Phase and Replicated Sampling Techniques 65 


The standard error can be calculated by first calculating the standard 
deviation of the sub-sample estimates : ј 


Sy xat (2, е: zy 


The standard error would then be : 


SEQ = / 5, 
W ak 

It may, however, be noted that these formulas are the same as used 
in the calculation of mean, standard deviation and standard error of the 
mean while using ungrouped raw scores and students of research who 
have done an elementary course in applied statistical, techniques are 
expected to be conversant with them. The standard error formula given 
above is applicable to all forms of sampling procedures employed in 

drawing the sub-samples. 


Merits and Limitations 

One of the merits of replicated sampling technique is the ease of standard 
error calculation as compared to more complex designs e.g., multi-stage 
stratified sampling. Replicated sampling technique can be supported on 
two more counts. Sometimes when the total sample is too large to permit 
the survey results to be ready in time, one or more of the sub-samples or 
replications can be used to obtain advance results. It may be a practical 
consideration sometimes. 

Another important merit concerns with the possibility of throwing 
light on variable non-sampling errors such as those due to interviewer 
variability in studies where interview was the major technique of data 
collection and more than one interviewer was used. 3 

However, there are certain precautions to be observed in the use of 
the replicated sampling. Firstly, each of the sub-samples’ has to bea 
random sample of the whole, otherwise they cannot be considered as 
comparable and difference between them as an unbiased estimate of 
between-interviewer-variation cannot be taken. For example, it would 
Not be good if for the purpose of taking iwo samples the whole population 
is divided according to sex and samples are taken one from each sex 
group. Division of the population on the basis of Arts group and Science 
group, rural and urban setting or some other criteria and selection of the 
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samples one each from the two groups will also confound the results. The 
difference between the results of the two sub-samples would not 
constitute a criterion of the difference between interviewers. It will then 
Not be possible to know how much of the difference was due to special 
groups and how much-to interviewers. 

Secondly, since the numbers in the sub-samples tend to be small, 
detailed investigation of the interviewer errors is rarely possible, only the 
major sources of variation being discovered. 

The systematic errors common to 
compensating errors will notbe discl 
in the separate sub-sam ples and the 
an interviewer's assignment. Hen 
valuable means of investigating no; 
аз à substitute for careful fi 

Another important dei 
sampling pertains to the n 


all interviewers and the 
osed. The former will appcarequally 
latter will cancel each other outover 


may be employed and each sub- 
terviewer. On the other hand, if it 
timates of standard errors, more 


3 (Primary Sampling Units) to be 
selectedand ten replications A lication mustcontainat least one 
Taust be at least ten PSU’s selected from 
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патио available. Secondly, in single stage sampling in the case of 
000 PSU’s and ten replications, there can Бе as many as 100 strata which 
would nearly be more than sufficient for the purpose. 


L 


i 
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Important New Ideas 


CLUSTER 

CLUSTER RANDOM SAMPLING 
TWO STAGE SAMPLING 
MULTI-PHASE SAMPLING 


MULTI STAGE SAMPLING 
PRIMARY SAMPLING UNITS (PSU) 
REPLICATED SAMPLING 


POST STRATIFICATION RATIO 


NON-PROBABILITY SAMPLING METHODS 


6.1 Introduction 
SAMPLES which are selected through non-random methods are called non- 


probability samples. Depending on the technique used, these are 
sometimes called accidental, incidental, purposive, convenience and 
quota samples. Since the term non-random is the most expressive of their 
nature these may be designated as non-random samples. Themain feature 
of these samples is the lack of control of the sampling error. Hence these 
can also be referred to as “uncontrolled” sampling methods. Since the 
selection of the units in these samples is based on judgement and not on 
equal or known probability, the same as a class, are known as non- 
probability sampling methods. In this chaptera description of the various 
non-probability methods of sampling is given. It is followed by a critical 
appraisal of the same. 


6.2 Incidental Sampling 
The term “incidental sampling” (also called accidental sampling) is 
generally applied to those groups which are used chiefly because they are 
easily or readily obtainable. А researcher employed in a school of 
Psychology may 


use the students enrolled in that department; a professor 
of education may undertake a study 


using the students enrolled in 

education classes and a research worker may use the children of the local 

schools or animals available in the local laboratory. These subjects are 

available in number, and under conditions none of which may be of the 

experimenter's choosing. Such casual groups rarely constitute random 
samples of any definable population. 

The main considerations in terms of merits of such a procedure are: 

the administrative convenience of obtaining subjects for the study, the 

ease of testing, saving in time and com pleteness of the data collected. The 


70 Better Sampling : Concepts, Techniques and Evaluation 


demerits of such a procedure lie in the fact that since there is no well 
defined population and no random method of selecting the sample has 
been used, standard error formulas apply with a high degree of 
approximation if at all to incidental samples and no valid generalizations 


can be drawn. Any attempt at generalization based on such data will be 
misleading. 


6.3 Purposive Sampling or Expert-Choice Sampling 
Samplesaresometimes expressly chosen because, in thelightofavailable 
information, these mirror some larger group with reference tooneormore 
given characteristics. The controls in such samples are usually identified 
as representative areas (city, county, State, district), representative 
characteristics of individuals (age, sex, marital Status, socio-economic 
Status, race) ortypesof groups (school administrators, school counsellors, 
elementary teachers, Secondary school teachers, college teachers, editors 
of dailies, housewives, visitors toa religious shrine). These controls may 
be further subdivided by specified categories within classes such as 
amount of training, years of experience, or attitudes toward a specific 
phenomenon. Up to this Stage, these controls are somewhat similar to 
those used in stratification. The researcher may select the number of cases 
in the sample proportionate to the total number of such cases with 
“control” characteristics in the population. It is also important to know 
whether these proportions were based on adequate information or scant 
and partial information about the population. 

Some examples of; purposive sam 
newspaper editors reflectin 
Social and economic 
housewives representin 
sample of brokers refl 


pling in social science research are: 


In another approach, experts choose "typical" or "representative" 
cases on the assumption that with judgement and reason a satisfactory 
sample can be chosen. For example, a teacher selects a few students as 
typicalorrepresentative of the Studentbody asa whole; aprincipal selects 
a typical teacher to be interviewed by the press. Sometimes experts are 
used to select atypical cases. 
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Purposive sampling differs from stratified random sampling in that 
theactual selection of the units to be included in the sample in each group 
is done purposively rather than by random method. The former is not a 
satisfactory procedure if high precision is required. There is absolutely no 
reason to believe that important characteristics are representative of the 
population when two or three characteristics are representative. А 
considerable research evidence has been put forward against such a 
belief. From the statistical point of view also this technique is particularly 
weak. There is по way of calculating the limits of permissible error, or the 
required number for the sample, if strict probability sampling is not used. 

The purposive sampling approach may be useful where it is 
necessary to include a very small number of units in the sample. Thus if 
one were faced with the problem of finding one district or even five or 
perhaps a dozen districts to represent the whole of a big country, 
purposive selection might be the best approach. A sample of one district 
ог five districts cannot ordinarily be found to represent a big country on 
a number of characteristics unless the investigator was extremely 
fortunate, or unless the answer to the problem was known for all practical 
purposes before the samples were selected;or unless thereisno variability 
between areas in the desired characteristics. 

Hansen, Hurwitz and Madow define the use of purposive sampling 
in certain special situations in the following words : : 

A method based on purposive selection is biased, but the biases 

probably would be smaller fora sample of one county selected to 

represent the United States, than the random errors would be in 
measurable method that depended on a random selection ofa single 
county. On the other hand if the sample is to include a considerable 
numberof units, then the biases of these purposive methods often will 
be more serious than the random errors introduced by the methods 
discussed in this book in which random or chance selection rather 


than purposive selection is used.' 


6.4 Quota Sampling : s 
Quota sampling is another non-probability sampling method. It о 
the selection of the sample units within each stratum orquota, on Us |. 
Of the judgement of the interviewers rather than on calculable chance о 


being included in it. | 
* A wide variety of procedures go under the name of quota sampling, 
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but what distinguishes them all fundamentally from probability sampling 
is that, once the general breakdown of the sample is decided (e.g. how 
many men and women, how many people in each age group and in each 
‘social class’ it is to include) and the quota assignments are allocated to 
interviewers, the choice of the actual sample units to fit into this 
framework is left to the interviewers. Quota sampling, 
method of stratified sampling in which the selection with 


random. It is this non-random element that constitu 
weakness,”? 


For example, let us Suppose that a national opinion survey based on 


quota sampling method is to be conducted. The first Step will be to stratify 


by region, by rural/urban area and perhaps by administrative districts or 
Constituencies within these broad 


in its initial stages, is similar to 
although not necessarily, 
initial stages of selection in 


is, therefore, a 
їп strata is non- 
tes its greatest 


controls. While age and sex are easy to 
define social class. Firstly, there is no rel 
the quotas because the definition of 5 
combination of objective factors such 


тау be supplemented by special 
housewife, head/not head of the famil 
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status. However, the use of many extra controls may make the 
interviewer's task difficult. 

On the issue of quota sampling versus probability sampling, Moser 
and Kalton say "Some experts hold the quota method to be so unreliable 
and prone to bias to be almost worthless; others think that, although it is 
obviously less sound theoretically than probability sampling, it can be 
used safely on some subjects; still others believe that with adequate 
safeguards quota sampling can be made highly reliable and the extra cost 
of probability sampling is not worthwhile. In general, statisticians have 
criticized the method for its theoretical weakness, while market and 
opinion researchers have defended it for cheapness and administrative 


convenience.'? 
Rummel points out the weakness of the quota sampling method in the 


following words : 

“ In quota sampling an interviewer is given a quota of cases he is to 
select bearing each of several pre-determined characteristics similar to 
those involved in purposive sampling but he is given considerable 
freedom in choosing the individual cases. This may lead to large biases, 
since those whoare often availableat the times and places the interviewer 
meets them are not representative of the population. Often times 
interviewers have used the line of the least resistance in meeting their 
quotas and have collected information only from those who were easily 
available for interview and have ignored people in areas more difficult to 


reach. Poor supervision of interviewing teams and inadequately defined 


controls have led to results that have caused serious criticisms of the 


method."* 
Another important and с 
end with representative samples 0 


through a comparison of the occupation \ 4 
comparable surveys, one based on anational quota sampling and the other 


оп a national random sample, illustrate as to how very large differences 
in the proportions may be introduced. The occupation distributions 
obtained through-the two types of sampling methods are given on next 


Page : 


rucial issue in quota sampling is—does it 
f the population. Moser and Stuart’, 
distribution achieved by two 
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Table 6.1 


Percentage Distribution by Occupation/Industry 
Quota and Random Samples 


Occupation/Industry Men Women 
Quota Random Quota Random 
Sampling Sampling Sampling Sampling 


Manufacturing 6.5 24.9 43 T2. 
Clerical 3.8 5.0 42 4.6 
Distributive 15.7 5.9 9.0 3.0 
Transport and Public 

Service 18.3 7.6 13 — 
Professional and 

Managerial 18.1 20.0 5.4 3.2 
Mining and Quarry 

making 14 46 — — 
Building and Road 

Making 14.3 6.3 0.2 — 
Agriculture 24 2.8 0.3 0.8 
Other Industries 15.5 8.1 10.3 5.5 
Housewives — 2.0 64.8 69.2 
Retired, unoccupied, 

Part time 40 12.6 0.2 6.6 
Not Stated — 0.2 — — 
Total 100.0 100.0 100.0 100.0 


Source : Quota sample— British Market Research Bureau ; Random 
sample-Government Social Survey. 


The table shows a large excess in the quota sample of persons 
employed in distribution, transport, the public services, and buildin gand 
road making. A correspondingly small proportion of persons are 


= —— - 
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employed in manufacturing. The random sample proportions were found 
to be broadly correct in comparison with data obtained through other 
Sources. 

Before the discussion of the quota sampling method is concluded it 
would be appropriate to sum up the main points made so far. The quota 
method involves the use of strata, but selection within the strata, is not 
doneonarandom basis—the field worker merely fillsa quota by securing 
the correct proportion per stratum. The chief merits of the quota sampling 
are : firstly, the method is less expensive. However, the greater the 
controls involved, the more expensive is the quota sample, but the smaller 
will be the risk of selection biases. While no accurate comparisons of the 
cost are possible, it is generally suggested that in quota sampling an 
interview costs, on the average, only a half or a third as much as arandom 
interview. It is so because in the former, there are no call-backs and the 
field worker has not to travel all over the town to track down pre-selected 
respondents. Secondly, administrative convenience is another advantage 
of quota sampling. The labour of random sample selection and the 
headaches of non-contacts and call backs are avoided. Moser and Stuart 
point out that—although there are no non-contacts in terms of pre- 
selected respondents, refusals in one study were found to be 8 percent. 
Thirdly, quota sampling is the only practicable method of sampling a 
population for which no suitable frame is available. In a market research 
study concerned to find out the characteristics of smokers ofa particular 
brand of cigarette, the population comprises only a small proportion of 
the total population, is sparsely spread over the country and is not listed 
onits own and hence no suitable frame isavailable. Quota sampling is the 
only practicable way to samplesuch populations. Lastly, in studies where 
the field work has to be done quickly to reduce memory errors quota 
sampling is the only answer. In audience research surveys conducted by 
various broadcasting concerns, 10 obtain the public reaction to their 
programmes, sometimes to the previous day's programme-quota 
sampling is the only method to contact à national or provincial sample. 
Sudman”* reports survey of immediate public reaction to President John 
F. Kennedy's assassination completed in about ten days, through quota 


sampling. 

The main criticism of q 
points, Firstly, that unlike ran 
estimates of standard errors to 


uota sampling is based on the following 
dom sampling it is not possible to attach 
the quota sample results. The use of 
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replicated method for the purpose will be very costly and difficult to set 
up. Secondly, the method does not allow for an easy supervision of the 
field workers. Thus the correctness of the datacollected remains doubtful. 
Lastly, although quota samples generally claim that instructions to, and 
constraints on field workers are sufficient to guard against the main 
dangers of selection bias, but it is a matter of belief rather than facts. 
Hence itremains doubtful whether quota sample was representative of the 
population, 

Attempts have been made to make the quota sampling more 
scientific through a combination of random and quota methods. Some 
quota samplers issue exact instructions to the interviewers for following 
a standard procedure. They may be told to contact every third house and 
interview one person until their quota was filled. Sudman®™ applied 
detailed geographical controls and prescribeda specific travel pattern for 


the interviewers. The modified method is called "probability sampling 
with quotas," 


carrying outastudy 
arry out, for instance, a 
е hundred cases each. As 
y is dropped, may it be at 
onclusive results have not 


This he might do by testing, 
be significantly superior or 
ulbs, he would then have his 
inconclusive, he would then 
provide a conclusive answer; 
addition of one batch of fifty 
d one way or the other and at a 
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The design can incorporate random and non-random techniques 
depending on the requirements of a particular study. The four batches of 
fifty bulbs each can be chosen randomly from one lot of 10,000 bulbs 
produced. These batches could be the first 200 produced by the 
manufacturerand further production suspended until a conclusive answer 
to their superiority or inferiority to the conventional bulbs was obtained. 
Another strength of this procedure is that if a basic flaw is noted in the 
design of the study, the first stage could be considered a pilot study of the 
others which would then be conducted on the basis of the improved 
design. Considerable economies in terms of time, labour and expenditure 
can be made more so, in case of research on very precious articles of 
production. 


6.6 Snowball Sampling 
The term “snowball sampling" has been used to describe a sampling 
procedure in which the sample goes on becoming bigger and bigger as the 
observation or study proceeds. For example, an opinion survey is to be 
conducted on smokers ofa particular brand of cigarette. At the first stage, 
we may pick up a few persons who are known to us or can be identified 
tobe the smokers of that brand. Atthe time of interviewing them, we may 
Obtain the names of other persons known to the first stage subjects. Thus 
the subjects go on serving as informants for the identification of more 
Subjects and the sample goes on increasing. The term snowball stems. from 
the analogy of a snowball, which begins small but becomes bigger and 
bigger as it rolls downhill. | 
Tenhouten et al? have developed a strategy for drawing a 
probabilistic snowball sample which would allow computation of 
estimates of sampling error and use of statistical tests of significance. 
Snowball sampling which is generally considered to be nonprobabilistic 
can be converted into probabilistic by selecting subjectsrandomly within 
each stage. For a nonprobabilistic sample, some method such as quota 


sampling can be used at each stage. 


6.7 Dimensional Sampling 

It is a variant of quota sampling 
multidimensional form of quota sam 
interest to the researcher in the popu ач 
that every combination of these dimension 
case, The procedure is as follows n 


and can be viewed basically as a 
pling. All dimensionsor variables of 
lation, are specifi ied and it is ensured 
s is represented by at least one 
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1. Explicitly delineate the universe to which you eventually wish 
to generalize; 

2: Spell out whatappear to be the most important dimensionsalong 
which the members of this universe vary and developa typology 
that includes the various combinations of values on these 
dimensions; and 

3. Use this typology as a sampling frame for Selecting a small 
number of cases from the universe, typically drawing one case 
from each cell of the typology. 

The basic purpose is to eliminate the Possibility of non- 
representation of any combination in the Sample. The hypothetical 
example given below may serve the purpose of further clarification. 
Suppose, a Survey of the opinion about the functioning of the present 
congress (I) government in India is to be conducted, The degree of 
Commitment to Congress ideology and whether this commitmentis intact 
Ог not are two important dimensions or variables of interest to the 
researcher. The typology can be diagramed as below : 


Whether intact Degree of Commitment to 

or Not Strong Moderate 

Intact Congress (Т) Leaders of 
Leaders other parties 

Not intact Former Congress Former Leaders 
Leaders 


of other parties 


Fig. 6.1 Typology in Dimensional Sampling 


Dimensional Sampling ensures that each of 


К these four cells is 
represented even in a very small sample. 


6.8 General Evaluation 


An interesting experiment described by Yates illustrate 
of the judgement selection in a rather striki 
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collection. The results showed the following types of divergences from 
representativeness of the samples from the population. 

1. Consistently nearly all observers over-estimated the average 

size of the whole collection. 

2. Only 6 of the 36 estimates were smaller than the true average 

weight and 3 of these were made by a single judge. 

3. The judges selected stones as near their concept of average size 

as possible. 

4. The proportion of the extreme size selected was much smaller 

than would be obtained in a random selection. 

To sum up, the experiment demonstrated that in judgment sampling 
there was a consistent tendency on the part of the judges to over-estimate 
the average size and under-estimate the variances. 

No statistical theory has been devised to measure the reliability of 
sample results by purposive and other non-random sampling methods. In 
some instances, however, it may be practically impossible to use 
probability sampling such as in drawing a sample of fish from the sea, or 
а sample of a specific type of wild life from the forest. Sometimes 
although the sample may be designated as probability sample, yet the 
incompleteness of the responses due to refusal and non co-operation of 
the respondents even after several attempts at persuasion, may take away 
the quality of randomness from the sample and measurementofreliability 
Of the estimates may become impossible. The problem of non-response 
has been discussed in detail in the chapter to follow. 
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Important New Ideas 
INCIDENTAL SAMPLING QUOTA SAMPLING 
ACCIDENTAL SAMPLING TYPICAL CASES 
UNCONTROLLED SAMPLING SEQUENTIAL SAMPLING 
CONVENIENCE SAMPLING SNOW BALL SAMPLING 
PURPOSIVE SAMPLING DIMENSIONAL SAMPLING 


EXPERT - CHOICE SAMPLING 


TYPOLOGY 


7 


SOME OTHER PROBLEMS 


7.1 Longitudinal Studies 

IN LONGITUDINAL studies, the aim is to collect data from the same sample 
on more than one occasion. The first Step is to obtain a sample through 
any of the probability sampling techniques. The second step and the more 
arduous one is the maintenance of the sample representativeness, The 
information is then sought from this sample at intervals. The sample thus 
used is called panel and hence in some books on research this method is 
referred to as “panel studies”, 

The technique is specially useful in experimental and pre-experi- 
mental designs’. In such studies, the effects of specifically introduced 
measures such as advertisement, a specch and the like are explored. The 
“before-after design” 


and Stanley? has several inherent weaknesses of internal validity. How- 


several observations ( О, 0,,0,,andO, 


in the first four or the last four observations, or if there was a steady trend 
in each group of four and if there was disco’ 
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invalidity. However, the discontinuity in O, and O, cannot be easily 
accounted for as "history" still remains a major source of invalidity 
because of the possibility that some event other than the predictor, 
occurring between О, and О, caused the discontinuity. Another advan- 
tage of taking multiple observations is to detect the transient effect 
because in a study predictor's effect might decay with time or it has its 
maximum impact some time after its administration. 

Another advantage of the panel study is greater precision. It nearly 
always measures changes with greater precision than does a series of 
independent samples of the same size. The variance of the change is lower 
for a panel study than for completely independent samples. The tech- 
niques of matching and blocking in experiments also serve this purpose 
to some extent by introducing positive correlation between matched pairs 
Or groups. 

Moser and Kalton point out another advantage of panel studies in the 


following words : | 
They enabie the researcher to measure not only net changes (which 


independent samples could do also) but in addition to identify and study 

the ‘changers’. The data can be conveniently set out in a turnover 

table——*.” | , 
These authors have given a turnover table which contains two 


examples. 
Table 7.1 


Examples of a Turnover Table 


Example 1 

Period 1 Favour x Period 2 Percent 

Do not 

Favour x Total 
Favour x 48 0 48 
Do not : 
Favour x 6 46 32 

46 100 


Total 54 
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Example 2 
Period 1 Favour x Period 2 Percent 
Do not Total 
Favour x 
Favour x 20 28 48 
Donot 
Favour x 34 18 52 
Total 54 46 100 


In both examples, 48 per cent favoured brand X in period 1 and 54 
per cent favoured it in period 2, but there is nevertheless a marked 
difference between the two sets of data. In Example 1, there are very few 
changes, with rio one ceasing to favour Brand X and just 6 per cent 
changing to favour Brand X in period 2. In Example 2, on the other hand, 
although the net change is the same 6 per cent increase in those favouring 
Brand X-thereisa great deal of change with 62 per centof the respondents 
changing theiranswer between the two periods. By means of the turnover 
table, the panel method makes it possible to measure gross as well as net 
changes. 

Relationships between questions asked at the same time be 
investigated from a single survey, but the relationship on an individual 
basis between the answer to questions asked at different times, often of 
great interest in social research, can only be investigated through panel 
study. 

Another important factor in causal analysis is the temporal ordering 
of variables. A cross sectional study may finda relationship between two 
variables such as worker's attitude towards his job and his position in the 
firm, but it cannot indicate which came first. A panel study recording 
promotions and measuring workers' attitude at intervals would provide 
data for distinguishing between these two possible explanations of the 
relationship. 

There are certain administrative advantages of the panels also. The 
costs of sample selection is reduced and planning of field work can be 
effective and easy. In the case of full co-operation from the respondents, 
fuller and more reliable data can be obtained. 
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The main problems in panel studies аге: 
(i) Achievement of the initial sample, 
(ii) Sample mortality and 
(ii) Sample conditioning. 

The recruitment of a representative panel of respondents to provide 
detailed information regularly at set intervals is a difficult task. A 
housewife may agree to answer a few questions on one occasion but may 
refuse to provide detailed information about her monthly purchases, use 
of various vegetables in kitchen, a list of entertainments the family 
enjoyed in a month, the category of guests she had to entertain at her 
house, and the like for extended periods of, say, a year or two. 

The problem of mortality in such studies arises in more than опе 
ways. It is the experience of many researchers that the first few reporting 
periods (weeks, months, or quarters) always take a heavy toll of panel 
membership. The rate of mortality decreases afterwards and the member- 
ship settles down. 

Wordsworth’ Le Mcasuricr® and Ehrenberg’ have described 
Attwood Consumer Panel which was required to record purchases 
covering a wide range of non-durable goods in a weekly diary. Initially 
80 per cent of the contact agreed to be enrolled on the pancl but by the 
time of the “first” reporting period 16 per centof these failed to co-operate 
thus leaving only 64 per cent of the sample. The Attwood panel lost a 
further 16 per cent of the total and thus 48 per cent of the original 
membership settled finally. Although the drop-outs were replaced by 
houscholds with similar demographic characteristics yet it was difficult 
to ascertain as to how typical those who dropped out or refused to 
cooperate were of the remainder. The drop-outs might have been less 
literate, busier, had bigger families and less interested in the sample and 
so оп. Sobol’ has reported the panel mortality and panel bias ina five-way 
study of economic attitude formation and change, plans to buy cars and 
durable goods, in fulfilment of these plans. The study was conducted by 
the Survey Research Cenue, University of Michigan. The respondents 
who drop out during the life of the panel can be compared with those 
remaining in terms of their earlier responses. The composition of the 
remaining sample can be checked against known data on some factors. 
But it may provide, at the most à partial reassurance. Its 
representativeness with regard to the behaviour and attitudes that are the 
subject of survey must remain in doubt. In panel studies, the greater the 


burden on respondents, the bigger, the problem of drop-out and refusals. 
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The second source of the panel mortality is the difficulty of tracing 
the respondents, especially when the interval between the interviews is 
as long as a year or more. Some of them will move house and it may be 
difficult to trace them to their new address. However, methods like 
checking with the post office for their forwarding addresses, consulting 
telephone directories and contacting their closest relative whose address 
had already been obtained may help in tracing the respondents. 

Depletion of the panel through deaths and emigration is another 
hazard. But these losses represent natural decrease in the population and 
do not cause a bias in the panel. The panel survivors are a representative 
sample of the surviving population. However,a supplementary sample of 
the new births (or people reachin g the age set for the defined population) 
and new immigrants is required to make the panel representative of the 
full current population. 

Repeated interviews with the panel members may lead to “panel 
conditioning”, thus making it untypical, in characteristics although not in 
composition, of the population it was selected to represent. Members of 
а radio listening panel may gradually become more critical of 
programmes, more interested and more attentive. Conditioning may also 
change the accuracy of reporting. Through experience improvement in 
memory may take place and thus lead to a more accurate recall of events. 


However, in cases where detailed records are required, completeness of 
the recording may decrease over time through fatigue. 


7.2 Master Samples 

Some studies require repeated sam 
such Cases, a master sample ma 
be taken as and when required. 
: simplicity of selection ргосед 
Sample as and when required, 

King and Jessen?,° have described the US Master Sample of 
Agriculture for which thecounty was divided into alarge number of small 
areas and a master sample of 70,000, representative of every county, was 
constructed. Sub-samples from this could be drawn to give whatever 
regional or national Coverage was required for a survey, 

However, it is important that the sample units into which a master 
sample is divided should be relatively permanent. A master sample of 
individuals or households will quickly become out-dated due to deaths, 
birthsand removals. A master sample of dwellings will berelatively more 


ples of the same area or population, In 
y be prepared from which sub-samplescan 
е main advantages of this procedure are 
ures and greater speed in obtaining a 
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permanent and hence of more long-term usefulness. 


7.3 The Problems of Non-response 

The term no -response may be defined to refer to the failure to measure 
some of the units in the selected sample. Non-response is a problem no 
investigator of human populations can escape. He can never get 
information about more than a part of the survey material. This is true of 
complete coverage studies as well as of sample surveys. Non-response 
does not include the units that fall outside the population. The sampling 
frame may include units that field investigation reveals to be non-existent 
Such as demolished houses, non-existent addresses and the dead. These 
units are considered to be blanks on the sampling frame and not as non- 
response. It is, therefore, advisable to subtract the number of blanks from 
the sample size before calculating the non-response rate. These blanks 
may be filled by substituting randomly selected units in order to keep the 
sample up to the required size. 

(i) Sources of Non-response 

Non-response occurs through various sources. Some of the more 
important of these are given below : 

(a) Unable to answer : The respondents may not have the 
information sought for or may refuse to answer due to some extrancous 
fears. The sample is likely to include some people too infirm, deaf or 
unfamiliar with the language to be interviewable. Ifarespondent puts the 


e of illness or lack of time, here is a case for 
leaving him as 


interviewer off with an excus 
calling back at another time and пог for | 
“Uninterviewable”. 


(b) The Unaccessible : This category consists of those units in the 


sample which could not be located or visited due to certain уно a 
as use of incomplete lists, bad weather and poor дна Со acili a 
(с) Not-at-homes : This group contains two types o proud 
households-those who remain away from home for longer than the i 
Work period, so that ordinary re-calling is inapplicable and (рео 
Teside at home but аге temporarily away from the house xs wd 
interviewer calls and can be contacted by re-calling. nori us оп 
inconvenient at one time and on whom а callback can be т 


included in this category. J б 

1 lation census carry compulsory 

d) Refusals : Surveys like popu при! 

es Viene to pi is punishable. In aei ru 
Some people do adamantly refuse 10 answer for one 
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This section of non-respondents can be designated as the “hard core". It 
represents a source of bias in the results because of the incompleteness 
of the returns. 

All the sources of non-response mentioned above apply to random 
sample surveys in which the units are pre-selected and to be interviewed 
at home. Where interviewing is to be done in factories, schools, colleges, 
offices and so forth, or a main questionnaire is to be used, some of the 
abovecategories donotapply. Inquotasampling, failures to interview are 
not recorded since the interviewer simply goes on to the next house or 
unit, 

The magnitude of the problem of non-response due to various 
Sources is likely to differ from survey to survey. It further depends on the 
study, the number of questions to be answered and the category of persons 
to be covered. A survey of Personal incomes or saving is likely to generate 
larger non-response than a survey of food habits. In straightforward 
interview surveys where answering a few questions is required, the non- 
response will be lower than in those which require keeping of lengthy and 
detailed accounts, In surveys of the latter type a response rate of 70 per 
centisconsidered quite good while in the former, atypical rate is probably 
nearer to 80 per cent. Moser and Kalton say, 

“We may assume that, in the average interview survey, there will be 
no information through non-response from something like 10 to 25 per 
cent of the selected sample, and there is much evidence in the survey 
literature that non-respondents and respondents differ in important 
respects. Housewives with large families are more likely to be found at 
home than those with few or no children ; day time interviewing will fail 
to find many young people and working menathome: keen cinema gocrs 
and greyhound enthusiasts are less likely to be found at home in the 
evenings than others. Nor is willingness to cooperate likely to spread 
evenly over the whole population," 

Several attempts have been made to find out the differences in 
response rates due to sex, age social class and other variables. Moser and 
Stuart"? report the following refusal rates in a quota sampling survey. 
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Table 7.2 


Refusal Rates in an Experimental Survey-by 
Sex, Age and Social Class (Percentages) 


Sex Age Group Social Class 
Male 6.1 20-29 4.6 Uppr 122 
Female 8.9 30-44 6.1 Middle 97 

45-64 11.0 Lower 63 
65+ 7.5 


Differences in the refusal rates between male and female, age group 
45-64 and each of the other three, and upper and lower social classes are 
striking. 

(ii) How to-Deal with the Problem ? 

Several approaches to the problem of non-response have been 
suggested in the literature on survey research. Some of the more 
important of these are described below : 

1. Substitution 

The non-respondents can be replaced by respondents and the 
sample kept up to the desired size. It would ensure adequacy of numbers 
for the intended analysis and keep sampling errors to their estimated 
magnitude. It is useful also in situations where the strata are not 
represented in the achieved sample in the planned proportions. In such 
situations, as far as possible, the substitutes should be taken at random 
from a reserve list drawn together with the initial sample. However, the 
risk of bias is not altogether eliminated. The danger arises from the 
possibility that non-respondents dif! fer significantly from respondents. 

2. Increasing the Response Rate 

(a) Response rate for different types of non-response can be 
increased through a skilful handling of the situation. A particularly good 
interviewer may manage to obtain response from the partially deaf, the 
partially unable-to-speak a particular language, and the less determined 
to answer. 8 n 

(b) Some of the “movers” can be tracked down by obtaining their 
forwarding addresses from the postal authorities or their meighbours. 
Another approach to deal with the problem is to substitute for the moved 
household, the new one that has moved in. Kish"? used, a selection table 
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to define a rigorous procedure for selecting an individual from the new 
household. It reduces the bias due to substitution toa negligible minimum 
and the movers can be correctly represented in the final sample. One of 
the flaws of the Kish Selection Table is that it does not cover movers into 
new houses. For further details, see Kish™* and Moser and Каноп". 

(c) Gray, Corlett and Frankland' have suggested a re-weighting 
procedure for dealing with movers listed on the Register of Electors, The 
Procedure is based on the assumption that the movers-in are 
representative of the movers-out and through the double weighting the 
former are included in the sample on their own accountand also оп behalf 
of the movers-out. 

(d) It is a fact that the refusal rate depends a good deal on the 
Tesearcher’s and the interviewer’s skill. Use of professional interviewers 
can cut down the rate of non-response. Durbin and Stuart", in an 
experiment on the relative efficacy of experienced and inexperienced 
interviewers, found that the experienced professional interviewers had 3 
to 4 per cent refusals against the inexperienced amateurs’ 13 percent. A 
Slight wisdom on the part of the Surveyor can goa long way in minimizing 
refusals, 


(i) The questionnaire should be keptas brief as possible so that the 
burden on the respondents is at a minimum, 

(ii) Inducement of financial rew 
than no rewards. 


(ш) The Sponsorship should be Strong and acceptable to the respon- 
dents. 


(iv) The importance of the surv. 
the respondents and their personal i 
to them. 


ards will bring in more responses 


еу should be clearly impressed upon 
nvolvement, if any, be brought home 


Stuart? report that when an appointment had been made 71 percentofthe 
second calls resulted in inte: 


appointment had been made, 
3. Appointments on Telephone 


The use of telephone for making prior appointment has been made 
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and the success of the method demonstrated. Sudman' showed that 
through making telephone appointment the average number of calls 
required to complete an interview was 1.7 compared with 2.3 calls per 
completed case with no telephone appointment. Scott and Jackson? also 
demonstrated that by making telephone appointments for an interview 
survey, the number of personal calls can be reduced without an 
appreciable increase in non-response. In a country like India where only 
a small section of the population owns telephones, the system is not 
practicable. 

However, appointments made in person or on telephone, may prove 
a double-edged weapon in that some people may make doubly sure of 
being out or not answering the door bell when the interviewer calls. 
Brunnerand Carroll? reporta marked detrimental effect of appointments 
on response rate. 

4. Call-backs 

However sensibly timed and arranged the first calls are, a surveyor 
would be extremely fortunate if he gets a 100 per cent response rate at 
them. Hence successive calls become necessary for completing the 
survey and reducing the bias due to non-response. A standard technique 
is to specify the number of call-backs, or a minimum number, that must 
be made onany unitbefore itis abandoned as "unable tocontact”. Stephen 
and McCarthy”, on the basis of analysis of the results of a number of 
surveys, report the following information about the number of calls 


required for completed surveys : 
Table 7.3 


Number of Calls Required for Completed Interviews 
(Per Cent Sample Contacted on Successive Calis) 


Respondents First Second Third PerCent TOTAL 
Call Call or later Non- 


Call response 
Any Adult’ 70 17 8 5 m 
Random Adult 37 32 23 8 


* “Any Adult” group included 2 housewife and a farm operator 


respectively in two different surveys. N 
Source : Average results as reported by Cochran. 
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It is amply evident from the above table that the problem of 
completing the interview survey is easier when any adult in the home is 
capable of answering the question than in those in which a single adult, 
chosenatrandom, is to be interviewed. The marked success of the second 
call reflects the work of the interviewer in finding out in advance when 
the desired respondents would be at home and available. 

On the use of call-backs two important considerations should not be 
lost sight of, Later calls would be expected to be more expensive per 
completed interview, since the houses are Sparsely located in the area 


assigned to the interviewer. Durbin and Stuart” Teport the following 
figures in this regard : 


Table 7.4 
Relative Cost per New Completed Interview at the ith Call 
Call 1 2 B 4 5 
Relative Cost" 100 112 127 151 250 


* Money spenton ith calls divided by numberof new interviewsobtained. 


Another problem thatarisesoutofcall-backs is the time-factor. Call- 
backs delay the final results. 


5. Sampling in time for Non-respondents 


Another method of. dealing with the problem of the not-at-homenon- 
respondents was suggested by Hartley% 


intervalsoftime, and inquiring of each one found athome as to other times 


specific period of time, For those 


probability of being included in the sample. 
The method will yield consistent estimates if the sampling in time is 
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in fact at random and if the probability of being at home can be 
approximated satisfactorily. In practice serious problems may be in- 
volved in the effort to accomplish this.?" 

6. Comparisons 

A random sub-sample of the non-respondents can be taken and a 
major effort to interview every one in the sub-sample can be made. 
Comparisons of respondents and non-respondents on sex ratio, age- 
distribution, social class grouping and various other characteristics can 
be made to find out if the non-respondents were broadly similar to the 
respondents. While this procedure may give the surveyor some 
confidence on the broad similarities of the two groups, it can never prove 
the absence of bias because it does not mean that the two groups are 
similar in respect of whatever it is the survey is studying. 

Certain procedures have been suggested for adjusting for non- 
response bias. The adjustment is generally made by reweighting the 
results according to the correct proportions of different strata in the 
population. However, what must be clearly understood is the fact that 
rcweighting ensures only that the sexes of different groups are correctly 
represented in the sample, it does not reduce any bias arising from 
unrepresentativeness within the strata, that is within each group. 

To sum up, the aim of a sample is to estimate certain population 
characteristics as accurately as possible and it would be irrationally 
incorrect to leave a bad sample as it is. The methods and procedures 
described in this section can help in reducing the magnitude of the 
problem of non-response and thus raising the accuracy and validity of the 
results. Sometimes, it may well pay to keep the size of the sample smaller 
than what the sampling theory prescribes and spend the resources thus 


freed on securing a high response rate. 


7.4 Sampling Techniques Used in Observation i 
Generally sampling involves persons, objects etc. However, in 
observation studies, sampling techniques may be used for taking out a 
sample of Time’ out of a universe of Times’; a sample of ‘events’, out 
of a universe of ‘events’; and a sample of 'situations' out ofauniverse of 
‘situations’. These methods are not so widely in use in surveys but form 
an important part of the observational techniques. Hence a brief 


description of these techniques is given below : 
1. Time Sampling 


The time-sampling technique requires that one record the frequency: 
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of observable forms of occurrences during a number of definite time 

intervals thatare systematically spaced” The examples of time sampling 

аге: 

G) — Staffmembersmaynotetheongoing behaviour ofa hospitalized 

psychiatric patient every 30 minutes. 

Gi) ^ Wemightinstructthe parentsofachild with behaviourproblems 
to set aside one hour a day to observe and record their own and 
their child's behaviour. 

(i) ^ We might also measure the blood pressure of a patient with 
essential hypertension every five minutes while exposing him to 
stressful stimuli, 

(iv) Observers might go into the home of a married couple three 
times a week at suppertime to observe husband - wife interac- 
tions. 

(v) А teacher or outside observer might record the behaviour of a 
first-grade child for five minutes during recess and lunch to help 
assess his social interaction behaviour. 


The student will find that in all these 


examples the observation occurs 
intermittently, 


at predetermined times and for predetermined duration. 
This is the simplest form of time sampling. Typically a target subject or 
group of subjects is identified; a block of time during which observations 
are to be madcis specified and that blockoftimeis further subdivided into 
smaller sampling intervals for the purpose of recording. Several 
variations of sampling time has been used by researchers. Only 
Occurrence or non-occurrence of behaviours may be noted at random or 
specified times. When block of time are used, the term "interval time 
sampling” is used. In cases, where observation is done atrandom for small 
periods, the term “momemtary time sampling” is used.?* 

In some studies, alternate periods for observing and recording are 
used. A wide range of samplin £ times, durations and schedules have also 
bcen used by investigators in applied behavioural analysis. Parameters of 
sampling period include behaviour rate; behaviour variability over 
Observation sessions; and Situational specificity or variability of 


environmental stimuli controlling the оссштепсе of the behaviour, 
Behaviour rates across observation Ses; 


Sions may be plotted to 
discern the trend. The graph may indicate a 


positive or negative slope. 
Slopescan be detected Statistically, through visual inspection, or through 
slope or trend analysis. 
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2. Event Sampling 

In event sampling, only selected behaviours in the subject's 
respective or selected environmental events are observed. The behaviour 
analyst selects one or a few events from the multitude of potentially 
observable target events. Behaviours are selected for observation as a 
function of several considerations such as 2° 
Nature of the referral problems, 
Hypothesized controlling events, 
Desired alternatives to a targeted problem, 
The desirability of assessing side-effects, and 
The desirability of assessing high rate behaviours which may covary 
with targeted low-rate behaviours. 


tA OO S 


Referred problems may include cases like the following : 

A mother is concerned with a child's tantrums; а teacher is worried 
about а student's aggressive behaviour; and a college student may fecl 
concerned about his attacks of anxicty. 

Specification of referral questions help in determining whether 
observation is possible or some other method would be required. The 
suitability of participant and non-participant observation will also be 
assessed on the basis of this information. The process of event sampling 
includes defining of a universe of targeted events and then to observe a 
sample of them. 


3. Situation Sampling 
Behaviour is considered to be situation specific. Since the 


probability of behaviour frequently varies under different antecedent 
environmental situations, ње should selectively observe a subject in 
situations where the probability of target behaviours is high. 

Situation sampling is probably most frequently used in monitoring 
social behaviours such as family interactions, aggression, social 
interactions between children, and communication between marital 
partners. In these cases, we are interested in monitoring specific type of 
social interactions and, therefore, select situations in which these 
interactions are most likely to occur. These situations may be recess 
periods, lunch time, working on academic assignments, watching 
television, drug ingestion. Anti-social behaviours, lying, stealing and 
some psychotic disorders sometimes do not lend themselves readily to 
situation sampling. However, high rate behaviours like asthmatic 
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attacks, depressive talk, migraine headaches, hallucinations, aggression 
social withdrawal etc., are easier to sample. | 

The three types of sampling discussed above lead to a loss in 
generalizability ofthe data. Applicability of inferences from time to time, 
event to event and situation to situation remains impossible, 


, 
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APPENDIX II 


HOW TO USE RANDOM NUMBERS 


Tre SAMPLING theory often assumes that the sample at hand was drawnat 
random. Researchers very often try to accomplish this target. Yet actual 
accomplishment of randomness is extremely difficult by ordinary 
methods. The tables of random numbers, however, have been prepared 
by statisticians to facilitate the process of random selection. These tables 
have been so prepared as to allow an equal probability of selection to the 
numerals, 0-9. 

The pre-requisite of the use of these tables is that it is practicable to 
count the observations and alloteacha serial number. These observations 
may be anything such as observations located ina numbered file or ina 
book with numbered pages and lines or lists of names of schools or 
students, houses, strects, workers in a factory, shops in a town, trees on 
a tract of land and the like. A series of random numbers is sclected from 
the table equal to the number of observations desired. Thc observations 
with the corresponding numbers are used. The columns of digits may be 
combined to give rise to numbers containing the requisite number of 
digits. 

For example, it is desired to draw a sample of 200 cards containing 
specific in formation from numbered file containing 5675 cards. Since the 
card numbers involve a maximum of four digits, groups of four numbers 
may be used. Each of the tables given in the pages to follow has 40 
columns and 25 rows. The tables can be used vertically or horizontally 
or diagonally. 

The starting point can be at the begi 
point located by throwing a pointed pencil а 
digit close to the dot thus made by the tip of the pencil. In the example 
cited above, suppose itis decided to use the tables vertically, the first four 
columns are to be used. From the first thousand numbers, the first number 
selected is 2,315 (row 1, Column 1-4). The card with this number is 


nning of the table or any other 
on the table and picking up the 
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withdrawn from the file. The operation continues until the desired 200 
cards have been obtained. After finishing the first four columns, numbers 
from the next four (5-8) columns be selected. If a number exceeds the 
largest number in the series i.e., 5675 in our example, it can be ignored 
or the largest multiple of n, (in this case 200) that can be subtracted from 
it, may be deducted from it and the remainder is the desired number card 
selected. 


The steps in the use of tables of random numbers can be enumerated 
as below. 

1. Giveaserial number 1-N or 0-(N-1) tounits of the population. Incase 
of populations containing 100, 1000, or 10,000 or 10,00,000 cases, 
the following plan of numbering is more convenient and takes less 
time and labour in picking up the required number of cases from the 


table. 
Population Size Scheme of Serial Numbers 
100 = 00—99 (All numbers have two digits; — 
use two columns) 
1000 = 000—999 (All numbers have three digits; 
use three columns) 
10000 = 0000—9999 (All numbers have four digits; 
use four columns) 
100000 = 00000- 99909 


(All numbers have five digits; 
use five columns) 


Thus in the selected columns there will be no number which will be 
larger than the largest in the series, The number of columns thus 
required will be less by опе than what is required in the scheme using 
serial numbers 1—100, or 1—1000 etc, 
Select a starting point as explained above, 
3. Select the number of columns equal to the number of digits in the 
largest number of the series, 

4. Go on selecting the numbers ina 

horizontally or diagonally. 


ы 


uniform way, vertically or 


5. After finishing one column 80 to the next adjacent column, 
6. Ifa number appears again, it may be ignored. 
7. 


For different samples, start from a different point, 
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Kendall and Smith warn that "it was почит... that the use of the same 
set of random numbers over and over in the same sampling experiment 
was illegitimate....”". For this reason repeated samplings of the same 
population requiring more than the total numbers given in íhis book, 
should be accomplished by using more extended table such as may be 
found in Kendall and Smith?, and Tippet?. 
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APPENDIX III 


A RATING SCALE FOR SELF EVALUATION 
OF SAMPLING PROCEDURES APPLIED 
IN A RESEARCH STUDY 


Section A 

PLEASE answer cach question on the basis of your own study. In case you 
do not remember everything you may consult your research report or 
other related records. Since it is а self-evaluation scale for your personal 
consumption and improvement if required, honesty and precision in 
answers will be self-rewarding. (Please use a tick mark for answering). 


1. Have you defined the population or universe 

of your study ? Yes No - 
2. Have you fixed up the limits of the 

population in terms of geographical, 

demographic and other relevant criteria? Yes | No. 
3. Can you say with full confidence that a 

particular unit or element belongs to 


your population ? Yes No . 
4. Have you ап assured access to all elements 

of the population ? Yes Na 
5. Is the sampling frame available or can it 

be constructed easily? Yes Мо . 


6. Have you checked and ensured that the 
sampling frame is uptodate and correct 
and does not have any omissions or 
duplications ? Yes № 


16. 


. Is your sample siz 


Have you made sure that the sampling 
frame does not exhibit any periodicity 
or other arrangements of elements ? 
Have you given a serial order to the 
elements or listed them up? 

Does the sampling frame contain 
elements according to the sampling 
unit such as individuals or groups or 
blocks etc.? 


. Have you decided upon the sampling 


unit ? 


. Does your sampling unit coincide with 


the experimental unit or unit of data 
collection and analysis ? 


. Have you decided upon the size of the 


sample you wish to have? 


. No personal judgment or intention has 


any place in the decision about the sample 
size ? 


. Is the decision of sample size based on 


some sound statistical considerations ? 


. Does the decision about sample 


size take care of the homogeneity 
or heterogeneity of the population ? 

Have you decided upon the 
magnitude of the error or discrepancy 
acceptable to you in deciding about the 
size of the sample ? 


. Is your sample large enough to be adequate 


for the purpose of the study ? 
e large enough to be 


representative of the population ? 


. Are you using а probability sampling 


design ? 
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20. Have you specifically mentioned the 
randomization technique used in your 


study ? Yes. 


21. Have you used tables of random 
numbers or some good mechanical 
device or computer services for the 
selection of the sample out of the scrially 


arranged sampling frame ? Mes. 


22. Did you stratify the population and 
obtain/prepare sampling frames for 


sub-populations ? Wes 


23. Were the criteria of stratification based 
on a sound evidence of their known 


relationship with the variables of the study? — Yes — 


24. Did you check whether the stratification 
criteria had no known relationship 
between/among themselves ? If yes, 
did you drop some of them ? Yes. 


25. Ате you giving proportional/optimal 


representation to all stratification criteria ? Wes. 
METHOD OF SELF SCORING 
Section A 


No__ 


Allota credit of 1 to each ‘Yes’ ticked, and a credit of 0 to each ‘No’ 
ticked. Add up the scores. The maximum | score can be 25 and the 
minimum possible score is 0. Now compare your score with the following 


to obtain a qualitative description. 
Sampling 
Very Good Average Poor Very 


good Poor 
21 and above 16-20 11-15 6-10 1-5 


Section В 

Reporting Stage 
How adequate and clear is your reporting ? Have you in your research 
report mentioned the following clearly and precisely ? Use a tick mark 


based on your own assessment of your reporting. 


Aut 


3 
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Very Somewhat 
clearly 


Population and its characteristics. 
Sampling frame and its sources. 
Sampling Unit. 

Precise Method or design of 
sampling such as simple random, 
cluster random etc. 

Fraction of the population taken 
as sample. 

Size of the sample. 

Composition of the sample in 
terms of sex, age group, other 
relevant variables. 

Sampling unit. 

(In two-stage or multi-stage 
sampling, unit at all stages) 
Number of persons or units from 
whom data actually collected. 
Number of cases of incomplete 
information (drop-outs). 

Number of cases rejected or kept 


out of analysis due to other reasons 


and justification thereof. 
Difficulties or problems or reasons 
of non-cooperation of subjects in 
giving information. 


clearly 
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Not 
at all 
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SELF SCORING : REPORTING 


Section B 


You may allot the numerical weights to each of your answers as 
follows : 


Very clearly Somewhat clearly Not at all 
2 1 0 


То obtain the total score on "Reporting Stage" add up these scores 


over all the 12 items. You may then read the following qualitative 
description about your reporting. 


Very Good Good Average Poor Very Poor 
Reporting Reporting Reporting ^ Reporting Reporting 
2landabove 15-19 10-14 5-9 0-4 


An Overall Assessment 

For an overall self-assessment of your sampling, use the following 
criteria. This includes section A and section B of the Scale. Add your 
scores on Scction A and section B and then consult the following table : 


Very Good Good Average Poor Very 


Poor 
Total Score :41 and above 31-40 21-30 11-20 0-10 


ADDITIONAL SCALES FOR GENERAL GUIDANCE ONLY 
(No Scoring Suggested) 


Section C 

Systematic Sampling 

1. If using systematic sampling did you ensure 
that there was no periodic effect or trend 


in the population or sampling frame ? Sese No- 
2. If periodic effect was visible, did you arrange 
the sampling frame or stratify и? Yes No. 


3. Did you use the middle value or the randomly 
selected value from the first K elements as the 
starting point? Yes — No 
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4. After obtaining the sample, did you give а proper 
check by inspection whether it looked 


representative of the population ? Yes — No 
Others 
5. Іп using clusters, did you use all the elements 
found in the selected clusters ? Yes № 
6. In multi-stage sampling, did you use T 
randomization at all stages ? ves No 
Section D Execution Stage 
The following points will be enlightening to the student. (No scoring 
suggested). 
1. Have you been able to obtain data from all the 
elements included in the initial sample ? Yes= = Nox 


2. Were the persons contacted at the time of 
data collection, in possession of the 


information sought for? Yes Ко . 
3. Did you make an all-out-effort to contact 

non-respondents and obtain data from them ? Yes Хо. 
4. Did you substitute non-respondents by an equal 

group added through random selection ? Yele No 
5. Did you obtain information from a small group 

selected randomly out of non-respondents and 

compared the characteristics of non-respondents 

Yes No 


with those of respondents ? 
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Post-stratification : Stratification which is carried out after the sample has 
been actually drawn or data collected. 


Quasi-Periodicity : When period effect is visible only at some values 
of n. 


Random Numbers : Numbers arranged through random selection. 


Random Selection : Selection of units based on equal chance of selection 
to each unit. 


Replicated Sampling : Sampling technique which uses a number of 
independent, self contained and adequate samples from the samc 
population; useful for estimating sampling error. 


Sample :A smaller portion taken, through some selected procedures, 
from a larger whole. 


Sample conditioning : Sample sensitization due to repeated exposures. 


Sample Mortality : Dropping out of the subjects from a sample on 
subsequent occasions of data collection or experiment. 


Sampling Design : A broad strategy used for sampling; generally 
classified as probability, nonprobability and mixed. 


Sampling Distribution : The distribution generated by a large number of 
statistics taken from a large number of random samples. 


Sampling Error : The sampling variability generally accounted for and 

taken care of by probability theory. 

Sampled Population : The population which has actually been sampled. 
(Sce target population for a distinction). 


Selection Bias: It signifies that sample, for whatever reasons has not been 
drawn according to pre-arranged specifications, 
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Simple Random Sampling : Random selection done from the list of 
individual units and not of groups. 


With replacement: Random selection in which the unit drawn each time 
is not put back in the aggregate so that the size of the aggregate 
remains constant at each draw of a unit. 


Without replacement : Random selection in which the unit drawn each 
time is put back in the aggregate leading to the decrease by one unit 


in the size of the aggregate every time. 


Sine Curve : A curve which changes in magnitude continuously and in 
direction, periodically. 


Situation Sampling : The subjects are observed sélectively in situations 
where the probability of target behaviour is high. 


Statistical Samples : Samples based on probability sampling techniques. 


Supply : See population. 


are selected from the list or frame 


Systematic Sampling : When units 
ry 8th or every 10th unit etc. 


through regular intervals as eve 


ich we intended to sample or over 


Target Population : The population wh 
as intended. (See experimentally 


which generalization of results w 
accessible population for a distinction). 


the result of the functional 


Technical Bias : A statistical effect which is 
rall possible samples to the 


form of the estimator not averaging ove 
true population value. 


record of frequency of observable forms of 


Time Sampling: A 
f definite time intervals 


occurrences during a number О 
systematically spaced is made. 


Universe : See Population. 


APPENDIX V 


Table of t, for use in determining the significance 
of statistics 


Example: When the df are 35 and 1—2.03, the .05 in column 
3 means that 5 times in 100 trials a divergence as large as that 


obtained may be expected in the positive and negative directions 
under the null hypothesis. 


Degrees of Probability (P) 

Freedom 0.10 0.05 0.02 0.01 
a) (2) (3) (4) (5) 
1 1=6.34 t=12.71 1=31.82 1=63.66 
2 2.92 4.30 6.96 9.92 
3 2:35 3.18 4.54 5.84 
4 2.13 2.78 3-15 4.60 
5 2.02 2.51 3.36 4.03 
6 1.94 2.45 3.14 3.71 
7 1.90 2.36 3.00 3.50 
8 1.86 2.31 2.90 3.36 
9 1.83 2.26 2.82 3.25 
10 1.81 2.23 2.76 3.17 
11 1.80 2.20 2.72 3.1 
12 1.78 2.18 2.68 3.06 
13 1.77 2.16 2.65 3.01 
14 1.76 2.14 2.62 2.98 
15 1.75 2.13 2.60 2.95 
16 1.75 2.12 2.58 2.92 
17 1.74 2.11 2.57 2.90 
18 1.73 2.10 2.55 2.88 


19 1.73 2.09 2.54 2.86 
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(1) (2) (3) 
20 1.72 2.09 
21 1.72 2.08 
22 1.72 2.07 
23 1.71 2.07 
24 1.71 2.06 
25 1.71 2.06 
26 1.71 2.06 
27 1.70 2.05 
28 1.70 2.05 
29 1.70 2.04 
30 1.70 2.04 
35 1.69 2.03 
40 1.68 2.02 
45 1.68 2.02 
50 1.68 2.01 
60 1.67 2.00 
70 1.67 2.00 
80 1.66 1.99 
90 1.66 1.99 
100 1.66 1.98 
125 1.66 1.98 
150 1.66 1.98 
200 1.65 1.97 
300 1.65 1.97 
400 1.65 1.97 
500 1.65 1.96 
1000 1.65 1.96 
1.65 1.96 


Fractional parts of the total area (taken as 0.1000) under the normal probability curve, 
corresponding to distances on the base-line between the mean and successive 
points laid off from the mean in units of standard deviation 


———Ó— — — S——— Ч ee i o: qe E =... 
Example: Between the mean and a point 1.38 c (=з ) are found 41.62% of the entire area 


under the curve. 


[441 


po .00 .01 .02 03 04 05 06 .07 .08 09 

0.0 0000 0040 0080 0120 0160 0199 0239 0279 0319 0359 
0.1 0398 0438 0478 0517 0557 0596 0636 0675 0714 0753 
0.2 0793 0832 0871 0910 0948 0987 1026 1064 1103 1141 
0.3 1179 1217 1245 1293 1331 1368 1406 1443 1480 1517 
0.4 1554 1591 1628 1664 1700 1736 1772 1808 1844 1879 
0.5 1915 1950 1985 2019 2054 2088 2123 2157 2190 2224 
0.6 2257 2291 2324 2357 2389 2422 2454 2486 2517 2549 


uornjvag puo sanbruu22 ү ‘sidavuop : дии фио5 anag 


0.7 
0.8 
0.9 
1.0 
pI 
1.2 
1.3 
1.4 
13 
1.6 
л 
1.8 
19 


2.0 
21 

2:2 
2:3 
2.4 
2.5 


2580 
2881 
3159 
3413 
3643 
3849 
4032 
4192 
4332 
4452 
4554 
4641 
4113 


4712 
4821 
4861 
4893 
4918 
4938 


2611 
2910 
3186 
3438 
3665 
3869 
4049 
4207 
4345 
4463 
4564 
4649 
4719 


4778 
4826 
4864 
4896 
4920 
4940 


2673 
2967 
3238 
3485 
3708 
3907 
4082 
4236 
4370 
4484 
4582 
4664 
4732 


4788 
4834 
4871 
4901 
4925 
4943 


2104 
2995 
3264 
3508 
3729 
3925 
4099 
4251 
4383 
4495 
4591 
4671 
4738 


4793 
4838 
4875 
4904 
4927 
4945 


2734 
3023 
3290 
3531 
3749 
3944 
4115 
4265 
4394 
4505 
4599 
4678 
4744 


4798 
4842 
4878 
4906 
4929 
4946 


2164 
3051 
3315 
3554 
3770 
3962 
4131 
4279 
4406 
4515 
4608 
4686 
4750 


4803 
4846 
4881 
4909 
4931 
4948 


2794 
3078 
3340 
3577 
3790 
3950 
4147 
4292 
4418 
4525 
4616 
4693 
4756 


4808 
4850 
4884 
4911 
4932 
4949 


2.6 
27 
2.8 
2.9 
3.0 
3.1 
3.2 
3,3 
3.4 
3.5 
3.6 
3.7 
3.8 
3:9 
4.0 
4.5 
5.0 


4953 
4965 
4974 
4981 
4986.5 
4990.3 
4993.129 
4995.166 
4996.631 
4997.674 
4998.409 
4998.922 
4999.277 
4999.519 
4999.683 
4999.966 


4955 
4966 
4975 
4982 
4986.9 
4990.6 


4999.997133 


4956 
4967 
4976 
4982 
4987.4 
4991.0 


4957 
4968 
4977 
4983 
4987.8 
49913 


4959 
4969 
4977 
4984 
4988.2 
4991.6 


4960 
4970 
4978 
4984 
4988.6 
4991.8 


4961 
4971 
4979 
4985 
4988.9 
4992.1 


4962 
4972 
4979 
4985 


4989.3 4989.7 
4992.4 4992.6 


4963 
4973 
4980 
4986 


4964 
4974 
4981 
4986 
4990.0 
4992.9 


TABLE OF RANDOM NU У 
MBERS 
(8,000 NUMBERS) Aar 


First Thousand 


13-16 


21-24 


25-28 | 29-32 


31-36 | 37-40 


Source: 
Numbers. 


83 72 
53 74| 35 98 
зо 43| 62 23 
9517 
75 53 
3393 
9164 
26 80| 4491 
42 98| 41 87 
94 82| 15 69 
62 71| 26 07 
86 38| 03 07 
17 36 


76 24 
go 61 
5005 
78 80 
94 25 
32 79 
93 оо 
5575 
69 53 
4195 
об 55 
52 55 
63 14 
3157 
90 52 
зо 72 


97 o8 
18 37 
10 03 
59 01 
12 58 


133% 
уо 04 
11 89 
82 96 
96 86 


8453 
$161 
89 51 
97 64 
95 65 
97 57 
o8 25 
16 68 


L- 


86 95 
44 10 
21 11 
94 12 
4054 
15 аа 
99 43 
12 58 
61 77 
79 45 


44 67 
48 89 
13 35 
48 62 
46 15 
56 99 
2122 
26 95 
3471 
66 99 


44 22 
89 88 
62 84 
67 65 
67 42 


1303 
96 22 
54 38 
4287 
88 21 
64 70 
64 07 
47 55 
73 
27 


33 
74 
oi 
5848 
06 $3 
29 82 
$3 26 
99 64 
68 75 
5361: 
03 42 
43 81 
91 95 
or 71 
24 $2 


67 44 
1343 
o8 34 
16 95 
9513 
68 50 
49 36 
257! 


Во| 95 
48) 28 
84| 53 
291 46 
74| 69 


27 
8o 
зо 
47 
93 


69 19 
ai 54 
76 5o 
15 87 
45 69 
18 67 
91 78 


Reproduced from M. 
Tracts for Computers ~ 


Kendall and 
XIV (London: 


B. Smith, 


В. rm 
Cambridge Unive 


Tabies 


of Random Sampling 


sity Press. 1 


939). pp. 2-5. 


RANDOM NUMBERS 


17-20 | 21-24 
4733|5127 


59 45 | 40 64 
13 14| 54 58 
58 43|24 61 
3o 86| 8o 74 
04 66|43 67 
57 46| 58 33 
95 19 | 27 38 


о очо машм 


16 бо | 68 до 
98 оо | 82 83 
та 48|36 83 
Ва 04 | 87 99 


за 42 | 72 63 
78 68 | 89 13 
35 86| оз 17 
86 12| 98 o8 
61 29178 49 


18 65 

за 04 

55 97| 96 75 

75 26| 96 45 
Fourth Thousand 


||. Г а [оча [чё aro [20-24 [oes [aaa [ae ee] 
40 8a 


94 43 | 93 64 
62 78 | 37 01 


69 92 | 40 79 
48 12 | 35 36 
43 07 | 07 22 


37 59 | 3431 
59 59177 46 
56 18 | o1 46 
40 81|47 54 
49 77 | о5 от 


33 93 | 36 60 
51 72 | 65 90 
74 87 | 88 оз 
57 10| 28 75 
бо 79| as 53 
41 96| 86 то 
66 71| 76 45 
29 94 
59 36 
31 77 
4795 
3115 
26 51 
60 14 17 79 
72 79 24 36| 87 9o 


RANDOM NUMBERS 


Fifth Thousand 


17-20 21-24 | 35-25 29-32 |33-36 | 37-49 
25 79 | то 47 | 88 93 | 79 61 

77 87 |75 51| от 31 | 03 42 

76 29 | 22 03| 99 41 | 24 38 

5a 69 | 65 95 | 92 об | 40 14 

44 28 | 64 81| 95 23 | 14 48 

98 ог | 68 53| 13 91 | 98 38 

97 77 | о9 59| 94 35 | 69 84 

13 51 | 80 13| 07 37 | 52 21 | 53 91 

72 09 | 92 58| 79 20 | 53 41 |02 18 | оо 64 
38 62 | 35 54| 14 44 | 66 88 |89 47| 41 Во 


99 82 | до 30| 87 80| 07 51 58 71 | 66 58 
14 68| 59 45| 51 87 | 56 о8 | до 8o| 66 бо 
59 28 | 44 12| 42 50 | 88 31 | 13 77 16 14 
44 04|64 97| 94 14 | ба 18 | 15 59] 83 35 
оз 71 | го 00| 99 86| 48 17 | 64 o6| 89 09 
78 70 | 45 97| 59 46| 62 21 | 17 25 06 20 
37 29 15 75| 93 96 | 91 76| 96 99 
34 48 23 23 | 94 бо | 88 79| об 17 
71 47 43 43| 55 81 | 19 45| 44 97 
2575 76 89| 18 os | 73 95| 72 22 
16 09 98 28 | 87 40 | 66 66] 66 92 
24 Ва 39 61| ба 47 | 49 11| 72 64 
76 19 91 a3| зо 14|63 28| 86 59 
20 07 33 84| оз 76 45 03) 33 33 
74 27 3145] 65 82 | 86 о5| 73 оо 


Bixth Thousand 

87 бо os 67| 88 15 |47 00 
66 o1 
63 26 
92 12 
92 42 
48 74 
84 20 
92 85 
65 42 
15 64 


27 72 
29 09 
то 19 
3312 
1794 
9з 85 
3975 
73 96 
оо 25 
07 98 
58 79 
91 16 
51 20 
за 44 
66 92 


RANDOM NUMBERS 


Seventh Thousand 


e te 


-- 
wa 


RET 
$3858 520 


sae 
ass 


17-20 


21- 


25-28 


36 90 
26 o8 
5132 
23 20 
74 76 
85 74 
10 30 
98 84 
74 23 
89 67 
93 86 
29 33 
4502 
66 13 
47 82 
3103 
23 88 
92 96 
35 45 
973! 


85 20 
54 45 
66 55 
26 о4 
86 55 


93 
42 
36 
74 
86 


44 
84 
95 
53 
22 
88 
29 


24 

91 

оо 

10 
59 
o6 
ЕН 
38 
04 
91 
81 


59 
19 
19 
оо 
59 
37 
22 
or 
08 
27 
23 
97 
36 
20 
72 


69 
зї 
23 
зо 
39 
86 
об 
75 
27 
94 
69 
50 
ot 
95 
19 


33 
73 
74 
зо 
29 
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Accessible population 15-16 Incidental sampling 69-70 
Accidental sampling 69 merits 69 
Area probability sampling 61 Inferential statistics 16 


Area sampling 61 
Judgement sampling 116 


Bailey, K.D. 50 
Bias in sampling 21-23, 116 Kalton, С. 45, 50, 63, 67, 73, 80, 83, 97 

defined 21-22 Kempthorne, O. 32 

reasons 22 Kendall, M.G. 21, 32, 42, 43, 109 
Bracht, С.Н. 32 Kish, L. 41, 43, 56, 58 

Known population 16 

CAT 17 
Central limit theorem 29 Lamarkshire milk experiment 25 
Cluster 60, 116 Law of large numbers 28, 117 


Cluster sampling 27, 60-61, 116 Longitudinal studies 82-86, 117 


Cochran, W.E. 32, 41, 42, 45, 50, 56, 58 


Comfield, J. 32 Madow, L.H. 50, 70, 80 


Mahalonobis, P.C. 66, 67 
Marks, E.S. 23, 32 

Master samples 86-87, 117 
Measurement bias 117 
Measurement errors 22-24 


Milne, A. 51 
Moser, C.A. 45, 50, 63, 67, 73, 80, 83, 


Demming, W.E. 21, 32, 67 
Dimensional sampling 77-78 


Estimator 23, 28, 29, 116 


Event sampling 95, 116 
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Experimentally accessible popu 
16, 116 97 
Experimental unit 30, 116 Mouly, G.J. 32 
Expert choice sampling 70-71 Movers 117 
Multistage sampling 61-64, 117 
Finney DJ. 51 distinguished from multistage 62 
Frame 116 two-phase 63 
Multistage sampling 62-63, 117 
Glass, G.V. 32 
Goode, W.J. 32 Neyman, J. 56, 58 
Goodman R. 57, 58 ч New-probability sampling 
methods 69-81 
Hansen, M.H. 71, 80 Non-response 87-93 
Нап, Р.К. 32 Non-sampling errors 117 
: Normal distribution 29 


Hurwitz, W.N. 71, 80 
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RESEARCH IN EMERGING FIELDS OF EDU 
Concepts, Trends and Prospects: 


The discipline of education has recently witnessed ‘ah exp osion , 


of knowledge. The multidisciplinary approach to the solution 
of educational problems has brought into focus several пем 
fields of research. Among these are: EnVironmental Education, 
Population Education, Bio-energy Education, Yoga Education— 
concepts that have been formulated and clarified. A good deal 
of research material has already been gathered. 


The present book, comprising 22 papers by contributors who 
are experts in their own specialities, presents to students, 
researchers, teachers and research supervisors, the basic 
concepts, the present status of research and future directions 
in some of the emerging fields of education. The book will 
serve the purpose of resource-cum-reference-cum-supple- 
mentary reading. 


STATISTICAL METHODS: Concepts, 
Applications and Computation 


The book presents a simple treatment of statistical methods 
as applied to education, psychology, sociology, economics, 


"commerce, management, social work and other social sciences. 


The main emphasis is on generating an understanding of 
concepts, applications and procedures involved in statistical 
analysis of experimental and observational data. It is the result 
of the author's long experience of teaching post-graduate 
classes. 

All important parametric and non-parametric tests have been 
presented in a manner that would be convenient evan to the 
non-mathematical mind. The student can easily grasp the 
procedures right from the setting up of hypotheses to the 
interpretation of results. 


The second edition is an improved one with all printing, and 
computational errors removed. The diagrams have also been 
improved. 
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